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NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO HELICOBACTER 
PYLORI FOR DIAGNOSTICS AND THERAPEUTICS 



Background of the Invention 
Helicobacter pylori is a gram-negative, S-shaped, microaerophilic bacterium that 
) was discovered and cultured from a human gastric biopsy specimen. (Warren. J.R. and B. 
Marshall. (1983) Lancet \ : 1273-1275; and Marshall et al., (1984) Microbios Lett. 25: 83- 
88). H. pylori has been strongly linked to chronic gastritis and duodenal ulcer disease. 
(Rathbone et. al., (1986) Gut 27: 635-641). Moreover, evidence is accumulating for an 
etiologic role of//, pylori in nonulcer dyspepsia, gastric ulcer disease, and gastric 
5 adenocarcinoma. (Blaser M. J., (1993) Trends Microbiol. \ : 255-260). Transmission of the 
bacteria occurs via the oral route, and the risk of infection increases with age. (Taylor. 
D.N. and M. J. Blaser, (1991 ) Epidemiol. Rev 1_3: 42-50). H. pylori colonizes the human 
gastric mucosa, establishing an infection that usually persists for decades. Infection by H. 
"pylori is prevalent worldwide. Developed countries have infection rates over 50% of the 
0 adult population, while developing countries have infection rates reaching 90% of the 

adults over the age of 20. (Hopkins R. J. and J. G. Morris (1994) Am. J. Med. 97: 265-277). 

The bacterial factors necessary for colonization of the gastric environment, and for 
virulence of this pathogen, are poorly understood. Examples of the putative virulence 
factors include the following: urease, an enzyme that may play a role in neutralizing gastric 
5 acid pH (Eaton et al., (1991 ) Infect. Immunol. 59: 2470-2475; Ferrero. R.L. and A. Lee 

(1991) Microb. Ecol. Hlth. Dis. 4: 121-134; Labigne et al., (1991) J. Bacteriol 173: 1920- 
1931 ); the bacterial flagellar proteins responsible for motility across the mucous layer. 
(Hazell et al.. (1986) J. Inf. Dis. 153: 658-663: Leying et al., (1992) Mol. Microbiol. 6: 
2863-2874: and Haas et al., (1993) Mol. Microbiol. 8: 753-760); Vac A. a bacterial toxin 
30 that induces the formation of intracellular vacuoles in epithelial cells (Schmitt. W. and R. 
Haas. (1994) Molecular Microbiol. 12(2}: 307-319); and several gastric tissue-specific 
adhesins. (Boren et al., (1993) Science 262: 1 892-1895; Evans et al., (1993) J. Bacteriol. 
175 : 674-683: and Falk et al., (1993) Proc. Natl. Acad. Sci. USA 90: 2035-203). 

Numerous therapeutic agents are currently available that eradicate H. pylori 
35 infections in vitro. (Huesca et. al., (1993) Zbl Bakt. 280: 244-252; Hopkins, R. J. and J. G. 
Morris, supra). However, many of these treatments are suboptimally effective in vivo 
because of bacterial resistance, altered drug distribution, patient non-compliance or poor 
drug availably. (Hopkins, R. J. and J. G. Morris, supra). Treatment with antibiotics 
combined with bismuth are part of the standard regime used to treat H. pylori infection. 
40 (Malfertheiner, P. and J. E. Dominguez-Munoz (1 993) Clinical Therapeutics 15 Supp. B: 
37-48). Recently, combinations of proton pump inhibitors and a single antibiotic have been 
shown to ameliorate duodenal ulcer disease. (Malfertheiner, P. and J. E. Dominguez- 
Munoz supra). However, methods employing antibiotic agents can have the problem of the 
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5 emergence of bacterial strains which are resistant to these agents. (Hopkins, R. J. and J. G. 
Morris, supra). These limitations demonstrate that new more effective methods are needed 
to combat H. pylori infections in vivo. In particular, the design of new vaccines that may 
prevent infection by this bacterium is highly desirable. 

^ Summary of the Invention 

This invention relates to novel genes, e.g., genes encoding bacterial surface 
proteins, from the organism Helicobacter pylori, and other related genes, their products, 
and uses thereof. The nucleic acids and peptides of the present invention have utility for 
diagnostic and therapeutics for H. pylori and other Helicobacter species. They can also be 
used to detect the presence of H. pylori and other Helicobacter species in a sample; and for 
use in screening compounds for the ability to interfere with the H. pylori life cycle or to 
inhibit H. pylori infection. More specifically, this invention features compositions of 
nucleic acids corresponding to entire coding sequences of//, pylori surface proteins or parts 
thereof, nucleic acids capable of binding mRNA from H. pylori surface proteins to block 
20 protein translation, and methods for producing H pylori surface proteins or pans thereof 
using peptide synthesis and recombinant DNA techniques. This invention also includes 
antibodies and nucleic acids sequences useful as probes to detect H pylori infection. In 
addition, vaccine compositions and methods for the protection against infection by H. 
pylori are described. 

25 

Brief Description of the Drawing s 
Figure 1 is a table which contains information from homology searches performed 
on the sequences of this invention using the BLAST algorithm contained in the Wisconsin 
Sequence Analysis Package. 
30 Figure 2 is a bar graph that depicts the antibody titer in serum of mice following 

immunization with specific H. pylori antigens. 

Figure 3 is a bar graph that depicts the antibody titer in mucous of mice following 
immunization with specific H pylori antigens. 

Figure 4 is a bar graph that depicts therapeutic immunization of//, pylori infected 
3 5 mice with specific antigens dissolved in HEPES buffer. 

Figure 5 is a bar graph that depicts therapeutic immunization of//, pylori infected 
mice with specific antigens dissolved in buffer containing DOC. 



40 



Detailed Description of the Invention 
In one aspect, the invention features a substantially pure nucleic acid encoding an 
H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 15, such as a nucleic 
acid comprising a nucleotide sequence of SEQ ID NO:l . 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 1 6, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:2. 

In another aspect the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 1 7. such as a 
10 nucleic acid comprising a nucleotide sequence of SEQ ID NO:3. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 1 8, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:4. 

!n another aspect, the invention features a substantially pure nucleic acid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 1 9. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:5. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 120. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:6. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 12 1 , such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:7. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 122, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO:8. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 123, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:9. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 24, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:10. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 125. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 1 . 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 126, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 12. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 127. such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 3 . 
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In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 28. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 14. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 29, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 15. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 130, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 16. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 131. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 7. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 32. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 1 8. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 33. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 19. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 34, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:20. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 135, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:21. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 136, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:22. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 37, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:23. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 138. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:24. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 13 9. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:25. 
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In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 140, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:26. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 141. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:27. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 142, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:28. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 43, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:29. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 144, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:30. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 145, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:3 1 . 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 146. such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO:32. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 147. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:33. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 48. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:34. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 149, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:35. 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 50. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:36. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:15L such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO:37. 



9719098A1J_> 



WO 97/19098 



PCT/US96/18S42 



10 



-6- 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 52, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:38. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 53 ? such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:39. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 54, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:40. 

In another aspect, the invention features a substantially pure nucleic acid encode™ 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 55, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:41 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 56. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:42. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 57. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:43. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 58, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO:44. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 59, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:45. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 160. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:46. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 61 , such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:47. 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 162. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:48. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 63, such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO:49. 
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In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:164. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:50. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 65. such as a 
0 nucleic acid comprising a nucleotide sequence of SEQ ID NO: 5 1 . 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 66, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:52. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
5 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 67. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:53. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 168. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:54. 
>0 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 69, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:55. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 70, such as a 
5 nucleic acid comprising a nucleotide sequence of SEQ ID NO:56. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 7 1 , such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:57. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an //. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 72, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:58. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 173, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:59. 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:1 74, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:60. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 75. such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO:61. 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 76, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:62. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 77 ; such as a 
1 0 nucleic acid comprising a nucleotide sequence of SEQ ID NO:63. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 78 ? such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:64. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 79, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:65. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 80, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:66. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 8 1, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:67. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 82, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO:68. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 83, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:69. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 84. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:70. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 85, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:71. 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 86, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:72. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 87, such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO:73. 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pvlori polypeptide having an amino acid sequence of SEQ ID NO: 188. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:74. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 89, such as a 
1 0 nucleic acid comprising a nucleotide sequence of SEQ ID NO:75. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:l 90. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:76. 

In another aspect, the invention features a substantially pure nucleic arid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 191, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:77. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 192. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:78. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:193. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:79. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 194. such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO:80. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 195, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:81 . 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 96. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:82. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 97, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:83. 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 98. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:84. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO: 1 99. such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO:85 . 
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In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO:200, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:86. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:201 , such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:87. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:202. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:88. 

In another aspect, the invention features a substantia! Jy pure nucleic acid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:203, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:89. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:204. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:90. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:205, such as a 
nucleic acid comprising a nucleotide sequence of SEQ IDNO:91. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:206, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO:92. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO:207. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:93. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an K pylori polypeptide having an amino acid sequence of SEQ ID NO:208. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:94. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:209, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:95. 
35 In another aspect, the invention features a substantially pure nucleic acid encoding 

an K pylori polypeptide having an amino acid sequence of SEQ ID NO:210, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:96. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:21 1, such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO:97. 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:2I2. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:98. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:213, such as a 
1 0 nucleic acid comprising a nucleotide sequence of SEQ ID NO:99. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an hi pylori polypeptide having an amino acid sequence of SEQ ID NO:2 14, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 100. 

In another aspect . the invention features a substantially pure nucleic acid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:2 15, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 101 . 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:21 6, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 102. 
20 In another aspect, the invention features a substantially pure nucleic acid encoding 

an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:21 7, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 103. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:2 1 8, such as a 
25 nucleic acid comprising a nucleotide sequence of SEQ ID NO: 104. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:219, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:105. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
30 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:220, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 106. 

In another aspect the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:22L such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 107. 
35 in another aspect, the invention features a substantially pure nucleic acid encoding 

an K pylori polypeptide having an amino acid sequence of SEQ ID NO:222, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO: 108. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an K pylori polypeptide having an amino acid sequence of SEQ ID NO:223, such as a 
40 nucleic acid comprising a nucleotide sequence of SEQ ID NO: 109. 
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5 In another aspect, the invention features a substantially pure nucleic acid encoding 

an N. pylori polypeptide having an amino acid sequence of SEQ ID NO:224. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:l 10. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:225, such as a 
1 0 nucleic acid comprising a nucleotide sequence of SEQ ID NO: 111. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:226. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:l 12. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
15 an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:227, such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:l 13. 

In another aspect, the invention features a substantially pure nucleic acid encoding 
an H. pylori polypeptide having an amino acid sequence of SEQ ID NO:228. such as a 
nucleic acid comprising a nucleotide sequence of SEQ ID NO:l 14. 
20 In another aspect, the invention comprises nucleic acids capable of binding mRN A 

of//, pylori. Such nucleic acid is capable of acting as anti-sense nucleic acid to control the 
translation of mRN A of H. pylori. A further aspect features a nucleic acid which is capable 
of binding specifically to an K pylori nucleic acid. Such nucleic acid has utility as probes 
and as capture reagents. 

25 In another aspect, the invention features an expression system comprising an open 

reading frame corresponding to H. pylori nucleic acid. The nucleic acid further comprises a 
control sequence compatible with an intended host. The expression system is useful for 
making polypeptides corresponding to // pylori nucleic acid. 

In another aspect, the invention features a cell transformed with the expression 

30 system to make H. pylori polypeptides. 

In another aspect, the invention features a method of generating antibodies against 
K pylori proteins which are capable of binding specifically to H. pylori proteins. Such 
antibody has utility as reagents for immunoassays to evaluate the abundance and 
distribution of//, /?v/or/-specific antigens. 

35 In another aspect, the invention features a method of generating vaccines for 

immunizing an individual against K pylori. The method includes: immunizing a subject 
with an //. pylori protein, e.g., a surface protein, or portion thereof, and a pharmaceutical ly 
acceptable carrier. Such vaccines have therapeutic and prophylactic utilities. 

In another aspect, the invention provides a method for generating a vaccine 

40 comprising a modified immunogenic K pylori protein, e.g., a surface protein, or portion 
thereof, and a pharmacologically acceptable carrier. 
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5 In another aspect, the invention features a method of evaluating a compound, e.g. a 

polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an H. pylori 
polypeptide. The method includes: contacting the candidate compound with an H. pylori 
polypeptides and determining if the compound binds or otherwise interacts with an H. 
pylori polypeptide. Compounds which bind H. pylori are candidates as activators or 

10 inhibitors of the bacterial life cycle. These assays can be performed in vitro or in vivo. 

In another aspect, the invention features a method of evaluating a compound, e.g. a 
polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an K pylori 
nucleic acid, e.g., DNA or RNA. The method includes: contacting the candidate 
compound with an H. pylori nucleic acid and determining if the compound binds or 

1 5 otherwise interacts with an K pylori polypeptide. Compounds which bind K pylori are 
candidates as activators or inhibitors of the bacterial life cycle. These assays can be 
performed in vitro or in vivo. 

The invention features, K pylori polypeptides, preferably a substantially pure 
preparation of an H. pylori polypeptide, or a recombinant H. pylori polypeptide. In 

20 preferred embodiments: the polypeptide has biological activity; the polypeptide has an 

amino acid sequence at least 60%, 80%, 90%, 95%, 98%, or 99% homologous to an amino 
acid sequence contained in SEQ ID NOs: 1 15-228; the polypeptide has an amino acid 
sequence essentially the same as an amino acid sequence in SEQ ID NOs: 1 1 5-228; the 
polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the polypeptide 

25 includes at least 5, preferably at least 10, more preferably at least 20, more preferably at 
least 50, 100, or 1 50 contiguous amino acids contained in SEQ ID NOs:l 1 5-228. 

In preferred embodiments: the K pylori polypeptide is encoded by a nucleic acid in 
SEQ ID NOs: 1 -1 14, or by a nucleic acid having at least 60%, 70%, 80%, 90%. 95%, 98%, 
or 99% homology with a nucleic acid of SEQ ID NOs: 1-1 14. 

30 In a preferred embodiment, the subject H. pylori polypeptide differs in amino acid 

sequence at 1, 2, 3, 5, 10 or more residues from a sequence in SEQ ID NOs: 1 15-228. The 
differences, however, are such that: the H. pylori polypeptide exhibits an H. pylori 
biological activity, e.g., the H. pylori polypeptide retains a biological activity of a naturally 
occurring H. pylori enzyme. 

35 In preferred embodiments, the polypeptide includes all or a fragment of an amino 

acid sequence contained in SEQ ID NOs: 1 15-228; fused, in reading frame, to additional 
amino acid residues, preferably to residues encoded by genomic DNA 5' to the genomic 
DNA which encodes a sequence contained in SEQ ID NOs: 1 1 5-228. 

In yet other preferred embodiments, the H. pylori polypeptide is a recombinant 

40 fusion protein having a first H. pylori polypeptide portion and a second polypeptide 

portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to H. 
pylori. The second polypeptide portion can be, e.g., any of glutathione-S-transferase, a 



BNSDOCID: <WO 9719099A1 J_> 



WO 97/19098 



PCT/US96/18542 



-14- 

5 DNA binding domain, or a polymerase activating domain. In preferred embodiment the 
fusion protein can be used in a two-hybrid assay. 

Polypeptides of the invention include those which arise as a result of alternative 
transcription events, alternative RNA splicing events, and alternative translational and 
postranslational events. 

10 The invention includes an immunogen which includes an H. pylori polypeptide in 

an immunogenic preparation, the immunogen being capable of eliciting an immune 
response specific for said K pylori polypeptide, e.g.. a humoral response, an antibody 
response, or a cellular response. In preferred embodiments, the immunogen comprises an 
antigenic determinant from a protein contained in SEQ ID NOs:l 15-228. 

15 In another aspect, the invention provides a substantially pure nucleic acid having a 

nucleotide sequence which encodes an //. pylori polypeptide. In preferred embodiments: 
the encoded polypeptide has biological activity the encoded polypeptide has an amino acid 
sequence at least 60%, 80%, 90%, 95%, 98%, or 99% homologous to an amino acid 
sequence contained in SEQ ID NOs: 1 1 5-228; the encoded polypeptide has an amino acid 

20 sequence essentially the same as an amino acid sequence in SEQ ID NOs: 1 1 5-228; the 

encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded 
polypeptide comprises at least 5, preferably at least 10, more preferably at least 20, more 
preferably at least 50, 100, or 150 contiguous amino acids contained in SEQ ID NOs:l 15- 
228. 

25 In preferred embodiments: the nucleic acid is that of SEQ ID NOs: 1-1 14; the 

nucleic acid is at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% homologous with a 
nucleic acid sequence contained in SEQ ID NOs: 1-1 14. 

In a preferred embodiment, the encoded H. pylori polypeptide differs in amino acid 
sequence at 1, 2, 3, 5, 10 or more residues, from a sequence in SEQ ID NOs:l 15-228. The 
30 differences, however, are such that: the H. pylori encoded polypeptide exhibits a H. pylori 
biological activity, e.g., the encoded H. pylori enzyme retains a biological activity of a 
naturally occurring H. pylori. 

In preferred embodiments, the encoded polypeptide includes all or a fragment of an 
amino acid sequence contained in SEQ ID NOs:l 15-228; fused, in reading frame, to 
35 additional amino acid residues, preferably to residues encoded by genomic DNA 5' to the 
genomic DNA which encodes a sequence contained in SEQ ID NOs:l 15-228. 

In preferred embodiments, the subject K pylori nucleic acid will include a 
transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or 
transcriptional enhancer sequence, operably linked to the H. pylori gene sequence, e.g.. to 
40 render the H. pylori gene sequence suitable for expression in a recombinant host cell. 

In yet a further preferred embodiment, the nucleic acid which encodes an K pylori 
polypeptide of the invention, hybridizes under stringent conditions to a nucleic acid probe 
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i corresponding to at least 12 consecutive nucleotides contained in SEQ ID NOs:l -1 14; more 
preferably to at least 20 consecutive nucleotides contained in SEQ ID NOs:l-l 14; more 
preferably to at least 40 consecutive nucleotides contained in SEQ ID NOs:l-l 14. 

In a preferred embodiment, the nucleic acid encodes a peptide which differs by at 
least one amino acid residue from the sequences shown in SEQ ID NOs:l 15-228. 
) In a preferred embodiment, the nucleic acid differs by at least one nucleotide from a 

nucleotide sequence shown in SEQ ID NOs:l-l 14 which encodes amino acids shown in 
SEQ ID NOs: 115-228. 

In another aspect, the invention includes: a vector including a nucleic acid which 
encodes an H pylori-hke polypeptide, e g., an. H pylori polypeptide: a host cell transacted 
5 with the vector; and a method of producing a recombinant H. pylori-Wte polypeptide, e.g., 
an H. pylori polypeptide; including culturing the cell, e.g., in a cell culture medium, and 
isolating the H. pylori-Mke polypeptide, e.g., an H. pylori polypeptide, e.g., from the cell or 
from the cell culture medium. 

In another aspect, the invention features, a purified recombinant nucleic acid having 
0 at least 50%, 60%, 70%, 80%, 90%, 95%, 98%. or 99% homology with a sequence 
contained in SEQ ID NOs: 1-1 14. 

The invention also provides a probe or primer which includes a substantially 
purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence 
which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense 
5 or antisense sequence contained in SEQ ID NOs: 1 -1 1 4, or naturally occurring mutants 
thereof. In preferred embodiments, the probe or primer further includes a label group 
attached thereto. The label group can be, e.g., a radioisotope, a fluorescent compound, an 
enzyme, and/or an enzyme co-factor. Preferably the oligonucleotide is at least 10 and less 
than 20, 30. 50, 100, or 150 nucleotides in length. 
30 The invention involves nucleic acids, e.g., RNA or DN A, encoding a polypeptide of 

the invention. This includes double stranded nucleic acids as well as coding and antisense 
single strands. 

The H. pylori strain, from which genomic sequences have been sequenced, has been 
deposited in the American Type Culture Collection(ATCC # 55679) as strain HP-J99. 

35 The nucleic acid sequences of this invention may be obtained directly from the 

DNA of the above referenced H. pylori strain by using the polymerase chain reaction 
(PCR). See "PCR, A Practical Approach" (McPherson, Quirke, and Taylor, eds., IRL Press, 
Oxford, UK, 1991) for details about the PCR. High fidelity PCR can be used to ensure a 
faithful DNA copy prior to expression. In addition, amplified products can be checked by 

40 conventional sequencing methods. Clones carrying the desired sequences described in this 
invention may be obtained by screening the libraries by means of the PCR or by 
hybridization of synthetic oligonucleotide probes to filter lifts of the library colonies or 
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5 plaques as known in the art (see, eg. Sambrook et aL Molecular Cloning. A Laboratory 
Manual 2nd edition, 1989. Cold Spring Harbor Press. NY). 

Nucleic acids isolated or synthesized in accordance with features of the present 
invention are useful, by way of example, without limitation, as probes, primers, capture 
ligands, antisense genes and for developing expression systems for the synthesis of proteins 
10 and peptides corresponding to such sequences. 

As probes, primers, capture ligands and antisense agents, the nucleic acid will 
normally comprise approximately twenty or more nucleotides for specificity as well as the 
ability to form stable hybridization products. 

Putative functions have been determined for several of the H pylori polypeptides nf 
1 5 the invention, as shown in Figure 1 . 

Accordingly, uses of the claimed H. pylori polypeptides in these identified functions 
are also within the scope of the invention. 

In addition, the present invention encompasses H. pylori polypeptides characterized 
as shown in Table 1 below, including: H. pylori outer membrane proteins, H. pylori 
20 periplasmic/secreted proteins, and other H. pylori surface proteins. Members of these 
groups were identified by BLAST homology searches. The H. pylori polypeptides 
identified in Table 1 are representative members of the groups identified above and are in 
no way limiting. Additional members of the groups can be identified within the H. pylori 
polypeptides disclosed herein by the methods known to those skilled in the art. 



25 

TABLE 1 



SEQ ID NO 


Blast 
identifier 


Gene 
Symbol/Name 


Description 


Outer Membrane Proteins 








71 16626 (SEQ ID NO:223) 


P26093 


e(P4) 


e (P4) lipoprotein attached by 
lipid in H. influenza 


29479681 (SEQ ID NO: 179) 


P13036 


fecA 


Receptor in Iron (III) dicitrate 
transport E. coli 


36126938 (SEQ ID NO: 199) 


L12346 


copB 


Major out. memb. prot. in M. 
catarrhalis 


Periplasinic/Secretted Proteins 








301 00332 (SEQ ID NO: 181) 


P23847 


dppA 


Periplasmic dipeptide binding 
protein in E. coli 


Other Surface Proteins 








4821082 (SEQIDNO:212) 


P08089 


M protein 


M protein of group A 
Streptococci 


978477 (SEQ ID NO:228) 


L28919 


FBP54 


Surface Ag of grp A. 
Streptococci binds fibronectin 



Definitions 

A purified preparation or a substantially pure preparation of a polypeptide, as used 
30 herein, means a polypeptide that has been separated from other proteins, lipids, and nuclei 
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5 acids with which it naturally occurs. Preferably, the polypeptide is also separated from 
substances, e.g., antibodies or gel matrix, e.g., polyacrylamide, which are used to purify it. 
Preferably, the polypeptide constitutes at least 10, 20, 50 70, 80 or 95% dry weight of the 
purified preparation. Preferably, the preparation contains: sufficient polypeptide to allow 
protein sequencing: at least 1 . 10, or 100 ug of the polypeptide; at least 1, 1 0, or 100 mg of 

1 0 the polypeptide. 

A purified preparation of cells refers to, in the case of plant or animal cells, an in 
vitro preparation of cells and not an entire intact plant or animal. In the case of cultured 
cells or microbial cells, it consists of a preparation of at least 10% and more preferably 50% 
of the subject cells. 

1 5 The metabolism of a substance, as used herein, means any aspect of the. expression, 

function, action, or regulation of the substance. The metabolism of a substance includes 
modifications, e.g., covalent or non covalent modifications of the substance. The 
metabolism of a substance includes modifications, e.g., covalent or non covalent 
modification, the substance induces in other substances. The metabolism of a substance 
20 also includes changes in the distribution of the substance. The metabolism of a substance 
includes changes the substance induces in the distribution of other substances. 

A substantially pure nucleic acid, e.g., a substantially pure DNA, is a nucleic acid 
which is one or both of: not immediately contiguous with both of the coding sequences 
with which it is immediately contiguous (i.e., one at the 5' end and one at the 3 1 end) in the 
25 naturally-occurring genome of the organism from which the nucleic acid is derived; or 

which is substantially free of a nucleic acid sequence with which it occurs in the organism 
from which the nucleic acid is derived. The term includes, for example, a recombinant 
DNA which is incorporated into a vector, e.g., into an autonomously replicating plasmid or 
virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate 
30 molecule (e.g., a cDNA or a genomic DNA fragment produced by PCR or restriction 

endonuclease treatment) independent of other DNA sequences. Substantially pure DNA 
also includes a recombinant DNA which is part of a hybrid gene encoding additional H. 
pylori DNA sequence. 

Homologous refers to the sequence similarity or sequence identity between two 
35 polypeptide molecules or between two nucleic acid molecules. When a position in both of 
the two compared sequences is occupied by the same base or amino acid monomer subunit, 
e.g., if a position in each of two DNA molecules is occupied by adenine, then the molecules 
are homologous at that position. The percent of homology between two sequences is a 
function of the number of matching or homologous positions shared by the two sequences 
40 divided by the number of positions compared x 100. For example, if 6 of 10, of the 

positions in two sequences are matched or homologous then the two sequences are 60% 
homologous. By way of example, the DNA sequences ATTGCC and TATGGC share 50% 
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5 homology. Generally, a comparison is made when two sequences are aligned to give 
maximum homology. 

The terms peptides, proteins, and polypeptides are used interchangeably herein. 
As used herein, the term "surface protein" refers to all surface accessible proteins, 
e.g. inner and outer membrane proteins, proteins adhering to the cell wall, and secreted 
1 0 proteins. 

As used herein, the term "transgene" means a nucleic acid sequence (encoding, e.g., 
one or more polypeptides), which is partly or entirely heterologous, i.e., foreign, to the 
transgenic animal or cell into which it is introduced, on is homologous to an endogenous 
gene of the transgenic animal or cell into which it is introduced, but which is designed to be 

1 5 inserted, or is inserted, into the cell's genome in such a way as to alter the genome of the 
cell into which it is inserted (e.g., it is inserted at a location which differs from that of the 
natural gene or its insertion results in a knockout). A transgene can include one or more 
transcriptional regulatory sequences and any other nucleic acid, such as introns, that may be 
necessary for optimal expression of the selected nucleic acid, all operably linked to the 

20 selected nucleic acid, and may include an enhancer sequence. 

As used herein, the term "transgenic cell M refers to a cell containing a transgene. 
As used herein, a "transgenic animal" is any animal in which one or more, and 
preferably essentially all, of the cells of the animal includes a transgene. The transgene can 
be introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 

25 by way of deliberate genetic manipulation, such as by microinjection or by infection with a 
recombinant virus. This molecule may be integrated within a chromosome, or it may be 
extrachromosomally replicating DNA. 

As used herein, the term "cell-specific promoter" means a DNA sequence that 
serves as a promoter, i.e., regulates expression of a selected DNA sequence operably 

30 linked to the promoter, and which effects expression of the selected DNA sequence in 
specific cells of a tissue. The term also covers so-called "leaky" promoters, which 
regulate expression of a selected DNA primarily in one tissue, but cause expression in 
other tissues as well. 

A polypeptide has H. pylori biological activity if it has one, two, three, and 

35 preferably more of the following properties: (1) if when expressed in the course of an H. 

pylon infection, it can promote, or mediate the attachment of//, pylori to a cell (2) it has an 
enzymatic activity characteristic of an H. pylori protein (3) or the gene which encodes it 
can rescue a lethal mutation in an K pylori gene. A polypeptide has biological activity if it 
is an antagonist, agonist, or super-agonist of a polypeptide having one of the above-listed 

40 properties. 

Misexpression, as used herein, refers to a non-wild type pattern of gene expression. 
It includes: expression at non-wild type levels, i.e., over or under expression; a pattern of 
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5 expression that differs from wild type in terms of the time or stage at which the gene is 
expressed, e.g., increased or decreased expression (as compared with wild type) at a 
predetermined developmental period or stage; a pattern of expression that differs from wild 
type in terms of decreased expression (as compared with wild type) in a predetermined cell 
type or tissue type; a pattern of expression that differs from wild type in terms of the 
1 0 splicing size, amino acid sequence, post-transitional modification, or biological activity of 
the expressed polypeptide; a pattern of expression that differs from wild type in terms of 
the effect of an environmental stimulus or extracellular stimulus on expression of the gene, 
e.g., a pattern of increased or decreased expression (as compared with wild type) in the 
presence of an increase or decrease in the strength of the stimulus. 
1 5 As used herein, "host cells" and other such terms denoting microorganisms or 

higher eukaryotic cell lines cultured as unicellular entities refers to cells which can become 
or have been used as recipients for a recombinant vector or other transfer DN A. and include 
the progeny of the original cell which has been transfected. It is understood by individuals 
skilled in the art that the progeny of a single parental cell may not necessarily be 
20 completely identical in genomic or total DNA compliment to the original parent, due to 
accident or deliberate mutation. 

As used herein, the term "control sequence" refers to a nucleic acid having a base 
sequence which is recognized by the host organism to effect the expression of encoded 
sequences to which they are ligated. The nature of such control sequences differs depending 
25 upon the host organism; in prokaryotes, such control sequences generally include a 

promoter, ribosomal binding site and terminators; in eukaryotes, generally such control 
sequences include promoters, terminators and in some instances, enhancers. The term 
control sequence is intended to include at a minimum, all components whose presence is 
necessary for expression, and may also include additional components whose presence is 
30 advantageous, for example, leader sequences. 

As used herein, the term "operably linked" refers to sequences joined or ligated to 
function in their intended manner. For example, a control sequence is operably linked to 
coding sequence by ligation in such a way that expression of the coding sequence is 
achieved under conditions compatible with the control sequence and host cell. 
35 An "open reading frame" , also referred to herein as ORF, is a region of nucleic acid 

which encodes a peptide. This region may represent a portion of a coding sequence or a 
total sequence. 

As used herein, a "coding sequence" is a nucleic acid sequence which is transcribed 
into messenger RNA and/or translated into a peptide when placed under the control of 
40 appropriate regulatory sequences. The boundaries of the coding sequence are determined by 
a translation start codon at the five prime terminus and a translation stop code at the three 
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5 prime terminus. A coding sequence can include but is not limited to messenger RNA, 
synthetic DNA, or recombinant nucleic acid sequences. 

A "gene product" is a protein or structural RNA which is specifically encoded for 
by a gene. 

As used herein, the term "probe" refers to a nucleic acid, peptide or other chemical 
10 entity which specifically binds to a molecule of interest. Probes are often associated with or 
capable of associating with a label. A label is a chemical moiety capable of detection. 
Typical labels comprise dyes, radioisotopes, luminescent and chemiluminescent moieties, 
fluorophores, enzymes, precipitating agents, amplification sequences, and the like. 
Similarly, a nucleic acid, peptide or other chemical entity which specifically binds to a 
1 5 molecule of interest and immobilizes such molecule is referred herein as a "capture ligand". 
Capture ligands are typically associated with or capable of associating with a support such 
as nitro-cellulose, glass, nylon membranes, beads, particles and the like. The specificity of 
hybridization is dependent on conditions such as the base pair composition of the 
nucleotides, and the temperature and salt concentration of the reaction. These conditions are 
20 readily discemable to one of ordinary skill in the art using routine experimentation. 

The experimental manipulation of such conditions has been well described in the 
literature including such books as Molecular Cloning; A Laboratory Manual Sambrook, J., 
Fritsch, E.F., Maniatis, T., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 2nd 
ed. (1989). 

25 The practice of the present invention will employ, unless otherwise indicated, 

conventional techniques of chemistry, molecular biology, microbiology, recombinant 
DNA, and immunology, which are within the skill of the art. Such techniques are 
explained fully in the literature. See e.g., Sambrook, Fritsch, and Maniatis, Molecular 
Cloning; Laboratory Manual 2nd ed. (1989); DNA Cloning, Volumes I and II (D.N Glover 

30 ed. 1985); Oligonucleotide Synthesis (M.J. Gait ed, 1984); Nucleic Acid Hybridization 
(B.D. Hames & S J. Higgins eds. 1984); the series, Methods in Enzymoloqy (Academic 
Press, Inc.), particularly Vol. 154 and Vol. 155 (Wu and Grossman, eds.) and PCR-A 
Practical Approach (McPherson, Quirke, and Taylor, eds., 1991). 

35 Probes 

A nucleic acid isolated or synthesized in accordance with SEQ ID NOs:l-l 14 can 
be used as a probe to specifically detect H. pylori With the sequence information set forth 
in the present application, sequences of twenty or more nucleotides are identified which 
provide the desired inclusivity and exclusivity with respect to H. pylori, and extraneous 
40 nucleic acid sequences likely to be encountered during hybridization conditions. More 
preferably, the sequence will comprise at least twenty to thirty nucleotides to convey 
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5 stability to the hybridization product formed between the probe and the intended target 
molecules. 

Sequences larger than 1 000 nucleotides in length are difficult to synthesize but can 
be generated by recombinant DNA techniques. Individuals skilled in the art will readily 
recognize that the nucleic acid sequences, for use as probes, can be provided with a label to 
1 0 facilitate detection of a hybridization product. 

Nucleic acid isolated and synthesized in accordance with SEQ ID NOs:l-114 may 
also be useful as probes to detect homologous regions (especially homologous genes) of 
other Helicobacter species using relaxed stringency hybridization conditions, as will be 
obvious to anybody skilled in the art. 

15 

Ca pture Ligand 

For use as a capture ligand, the nucleic acid selected in the manner described above 
with respect to probes, can be readily associated with a support. The manner in which 
nucleic acid is associated with supports is well known. Nucleic acid having twenty or more 

20 nucleotides in a sequence contained in SEQ ID NOs: 1 -1 1 4 have utility to separate H. pylori 
nucleic acid from the nucleic acid of each other and other organisms. Nucleic acid having 
twenty or more nucleotides in a sequence shown in SEQ ID NOs: 1-11 4 may also have 
utility to separate other Helicobacter species from each other and from other organisms. 
Preferably, the sequence will comprise at least twenty nucleotides to convey stability to the 

25 hybridization product formed between the probe and the intended target molecules. 

Sequences larger than 1000 nucleotides in length are difficult to synthesize but can be 
generated by recombinant DNA techniques. 

Primers 

30 Nucleic acid isolated or synthesized in accordance with the sequences described 

herein have utility as primers for the amplification of H. pylori nucleic acid. These nucleic 
acids may also have utility as primers for the amplification of nucleic acid sequences in 
other Helicobacter species. With respect to polymerase chain reaction (PCR) techniques, 
nucleic acid sequences of > 10-15 nucleotides contained in SEQ ID NOs: 1-1 14 have utility 

35 in conjunction with suitable enzymes and reagents to create copies of H. pylori nucleic 
acid. More preferably, the sequence will comprise twenty or more nucleotides to convey 
stability to the hybridization product formed between the primer and the intended target 
molecules. Binding conditions of primers greater than 100 nucleotides are more difficult to 
control to obtain specificity. High fidelity PCR can be used to ensure a faithful DNA copy 

40 prior to expression. In addition, amplified products can be checked by conventional 
sequencing methods. 
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5 The copies can be used in diagnostic assays to detect specific sequences, including 

genes from H. pylori and/or other Helicobacter species. The copies can also be 
incorporated into cloning and expression vectors to generate polypeptides corresponding to 
the nucleic acid synthesized by PCR. as will be described in greater detail below. 

10 Ami sense 

Nucleic acid or nucleic acid-hybridizing derivatives isolated or synthesized in 
accordance with the sequences described herein have utility as antisense agents to prevent 
the expression of//, pylori genes. These sequences may also have utility as antisense 
agents to prevent expression of genes of other Helicobacter species. 

1 5 Nucleic acid or derivatives corresponding to H. pylori nucleic acid sequences is 

loaded into a suitable carrier such as a liposome or bacteriophage for introduction into 
bacterial cells. For example, a nucleic acid having twenty or more nucleotides is capable of 
binding to bacteria nucleic acid or bacteria messenger RNA. Preferably, the antisense 
nucleic acid is comprised of 20 or more nucleotides to provide necessary stability of a 

20 hybridization product of non-naturally occurring nucleic acid and bacterial nucleic acid 
and/or bacterial messenger RNA. Nucleic acid having a sequence greater than 1 000 
nucleotides in length is difficult to synthesize but can be generated by recombinant DNA 
techniques. Methods for loading antisense nucleic acid in liposomes is known in the art as 
exemplified by U.S. Patent 4,241,046 issued December 23. 1980 to Papahadjopoulos et al. 

25 

Expressing H. pylori Genes 

The function of a specific gene or operon can be ascertained by expression in a 
bacterial strain under conditions where the activity of the gene product(s) specified by the 
gene or operon in question can be specifically measured. Alternatively, a gene product may 

30 be produced in large quantities in an expressing strain for use as an antigen, an industrial 
reagent, for structural studies, etc. This expression could be accomplished in a mutant 
strain which lacks the activity of the gene to be tested, or in a strain that does not produce 
the same gene product(s). This includes, but is not limited to other Helicobacter strains, and 
other bacterial strains such as E. coli, Norcardia, Corynebacterium, and Streptomyces 

35 species. In some cases the expression host will utilize the natural Helicobacter promoter 
whereas in others, it will be necessary to drive the gene with a promoter sequence derived 
from the expressing organism (e.g.. an £ coli beta-galactosidase promoter for expression in 
E. coli). 

To express a gene product using the natural H. pylori promoter, a procedure such as 
40 the following is used. A restriction fragment containing the gene of interest, together with 
its associated natural promoter element and regulatory sequences (identified using the DNA 
sequence data) is cloned into an appropriate recombinant plasmid containing the following 
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5 components: an origin of replication that functions in the host organism, and an appropriate 
selectable marker. This can be accomplished by a number of procedures known to those 
skilled in the art. It is most preferably done by cutting the plasmid and the fragment to be 
cloned with the same restriction enzyme to produce compatible ends that can be ligated to 
join the two pieces together. The recombinant plasmid is introduced into the host organism 

1 0 by electroporation and cells containing the recombinant plasmid are identified by selection 
for the marker on the plasmid. Expression of the desired gene product is detected using an 
assay specific for that gene product. 

In the case of a gene that requires a different promoter, the body of the gene (coding 
sequence) is specifically excised and cloned into an appropriate expression plasmid. This 

1 5 subcloning can be done by several methods, but is most easily accomplished by PCR 

amplification of a specific fragment and ligation into an expression plasmid after treating 
the PCR product with a restriction enzyme or exonuclease to create suitable ends for 
cloning. 

20 Expressed Genes in Therapeutics 

Nucleic acid isolated or synthesized in accordance with the sequences described 
herein have utility to generate proteins and peptides. The nucleic acid exemplified in SEQ 
ID NOs:l-l 14 or fragments of said nucleic acid sequences encoding immunogenic portions 
of H. pylori proteins (SEQ ID NO: 1 15-228) can be cloned into suitable vectors or used to 
25 isolate nucleic acid. The isolated nucleic acid is combined with suitable DNA linkers and 
cloned into a suitable vector. 

The host cell may be any procaryotic or eucaryotic cell. For example, an H. pylori 
peptide may be expressed in bacterial cells such as E. coli, insect cells (baculovirus), yeast, 
or mammalian cells such as Chinese hamster ovary cell (CHO). Other suitable host cells 
30 are known to those skilled in the art. 

Expression in eucaryotic cells such as mammalian, yeast, or insect cells can lead to 
partial or complete glycosylation and/or formation of relevant inter- or intra-chain disulfide 
bonds of a recombinant peptide product. Examples of vectors for expression in yeast 
S. cerivisae include pYepSecl (Baldari. et al., (1987) EmboJ. 6:229-234), pMFa (Kurjan 
35 and Herskowitz, (1982) Cell 30:933-943), pJRY88 (Schultz et al., (1987) Gene 54:1 13- 

123). and pYES2 (Invitrogen Corporation, San Diego, CA). Baculovirus vectors available 
for expression of proteins in cultured insect cells (SF 9 cells) include the pAc series (Smith 
et al., (1983) Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklow, V.A.. and 
Summers, M.D., (1989) Virology 170:31-39). Generally, COS cells (Gluzman, Y., (1981) 
40 Cell 23: 1 75-1 82) are used in conjunction with such vectors as pCDM 8 (Aruffo. A. and 
Seed, B., (1987) Proc. Natl. Acad Sci. USA 84:8573-8577) for transient 
amplification/expression in mammalian cells, while CHO (drift" Chinese Hamster Ovary) 
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5 cells are used with vectors such as pMT2PC (Kaufman et al. (1987), EMBOJ. 6:187-195) 
for stable amplification/expression in mammalian cells. Vector DNA can be introduced 
into mammalian cells via conventional techniques such as calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated transfection, or electroporation. 
Suitable methods for transforming host cells can be found in Sambrook et al. ( Molecular 

1 0 Cloning: A Laborato ry Manual . 2nd Edition, Cold Spring Harbor Laboratory press 
(1989)), and other laboratory textbooks. 

Expression in procaryotes is most often carried out in E. coli with either fusion or 
non-fusion inducible expression vectors. Fusion vectors usually add a number of NH-> 
terminal amino acids to the expressed target gene. These NH 2 terminal amino acids often 

1 5 are referred to as a reporter group. Such reporter groups usually serve two purposes: 1 ) to 
increase the solubility of the target recombinant protein; and 2) to aid in the purification of 
the target recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the reporter 
group and the target recombinant protein to enable separation of the target recombinant 

20 protein from the reporter group subsequent to purification of the fusion protein. Such 
enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and 
enterokinase. Typical fusion expression vectors include pGEX (Amrad Corp., Melbourne. 
Australia), pMAL (New England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, 
Piscataway, NJ) which fuse glutathione S-transferase, maltose E binding protein, or protein 

25 A, respectively, to the target recombinant protein. A preferred reporter group is poly(His), 
which may be fused to the amino or carboxy terminus of the protein and which renders the 
recombinant fusion protein easily purifiable by metal chelate chromatography. 

Inducible non-fusion expression vectors include pTrc (Amann et al., (1988) Gene 
69:301-315) and pET1 Id (Studier et al.. Gene Expression Technology: Methods in 

30 Enzymology 185, Academic Press, San Diego, California (1990) 60-89). While target gene 
expression relies on host RNA polymerase transcription from the hybrid trp-lac fusion 
promoter in pTrc, expression of target genes inserted into pETl Id relies on transcription 
from the T7 gnlO-lac 0 fusion promoter mediated by coexpressed viral RNA polymerase 
(T7 gnl). This viral polymerase is supplied by host strains BL21(DE3) or HMSI74(DE3) 

35 from a resident X prophage harboring a T7 gnl under the transcriptional control of the 
lacUV 5 promoter. 

For example, a host cell transfected with a nucleic acid vector directing expression 
of a nucleotide sequence encoding an H. pylori peptide can be cultured under appropriate 
conditions to allow expression of the peptide to occur. The peptide may be secreted and 
40 isolated from a mixture of cells and medium containing the peptide. Alternatively, the 
peptide may be retained cytopiasmically and the cells harvested, lysed and the protein 
isolated. A cell culture includes host cells, media and other byproducts. Suitable media for 



BNSDOCID: <WO 971909SA1 1 > 



WO 97/19098 



PCT/US96/18542 



5 cell culture are well known in the art. Peptides of the invention can be isolated from cell 
culture medium, host cells, or both using techniques known in the art for purifying proteins 
including ion-exchange chromatography, gel filtration chromatography, ultrafiltration, 
electrophoresis, and immunoaffmity purification with antibodies specific for such peptides. 
Additionally, in many situations, peptides can be produced by chemical cleavage of a 

1 0 native protein (e.g., tryptic digestion) and the cleavage products can then be purified by 
standard techniques. 

In the case of membrane bound proteins, these can be isolated from a host cell by 
contacting a membrane-associated protein fraction with a detergent forming a solubilized 
complex, where the membrane-associated protein is no longer entirely embedded in the 

1 5 membrane fraction and is solubilized at least to an extent which allows it to be 

chromatographically isolated from the membrane fraction. Several different criteria are 
used for choosing a detergent suitable for solubilizing these complex. For example, one 
property considered is the ability of the detergent to solubilize the H. pylori protein within 
the membrane fraction at minimal denaturation of the membrane-associated protein 

20 allowing for the activity or functionality of the membrane-associated protein to return upon 
reconstitution of the protein. Another property considered when selecting the detergent is 
the critical micells concentration (CMC) of the detergent in that the detergent of choice 
preferably has a high CMC value allowing for ease of removal after reconstitution. A third 
property considered when selecting a detergent is the hydrophobicity of the detergent. 

25 Typically, membrane-associated proteins are very hydrophobic and therefore detergents 
which are also hydrophobic, e.g. the triton series, would be useful for solubilizing the 
hydrophobic proteins. Another property important to a detergent can be the capability of 
the detergent to remove the //. pylori protein with minimal protein-protein interaction 
facilitating further purification. A fifth property of the detergent which should be 

30 considered is the charge of the detergent. For example, if it is desired to use ion exchange 
resins in the purification process then preferably detergent should be an uncharged 
detergent. Chromatographic techniques which can be used in the final purification step are 
known in the art and include hydrophobic interaction, lectin affinity, ion exchange, dye 
affinity and immunoaffmity. 

35 One strategy to maximize recombinant H. pylori peptide expression in E. coli is to 

express the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expre ssion Technology: Methods in 
Enzvmology 185 , Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy would be to alter the nucleic acid encoding H. pylori peptide to be inserted into an 

40 expression vector so that the individual codons for each amino acid would be those 

preferentially utilized in highly expressed E. coli proteins (Wada et al., (1992) Nuc. Acids 
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5 Res^ 20:21 11-2118). Such alteration of nucleic acids of the invention can be carried out by 
standard DNA synthesis techniques. 

The nucleic acids of the invention can also be chemically synthesized using 
standard techniques. Various methods of chemically synthesizing poiydeoxynucleotides 
are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
10 automated in commercially available DNA synthesizers (See, e.g., Itakura et al. U.S. Patent 
No. 4,598,049; Caruthers et al. U.S. Patent No. 4,458.066; and Itakura U.S. Patent Nos. 
4,401,796 and 4,373,071, incorporated by reference herein). 

Dru£ Screening Assays 

1 5 By making available purified and recombinant K pylori polypeptides, the present 

invention provides assays which can be used to screen for drugs which are either agonists 
or antagonists of the normal cellular function, in this case, of the subject K pylori 
polypeptides, or of their role in intracellular signaling. Such inhibitors or potentiators may 
be useful as new therapeutic agents to combat H. pylori infections in humans. A variety of 

20 assay formats will suffice and, in light of the present inventions, will be comprehended by 
the skilled artisan. 

In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 

25 systems, such as may be derived with purified or semi-purified proteins, are often preferred 
as "primary" screens in that they can be generated to permit rapid development and 
relatively easy detection of an alteration in a molecular target which is mediated by a test 
compound. Moreover, the effects of cellular toxicity and/or bioavailability of the test 
compound can be generally ignored in the in vitro system, the assay instead being focused 

30 primarily on the effect of the drug on the molecular target as may be manifest in an 

alteration of binding affinity with other proteins or change in enzymatic properties of the 
molecular target. Accordingly, in an exemplary screening assay of the present invention, 
the compound of interest is contacted with an isolated and purified K pylori polypeptide. 

Screening assays may be constructed in vitro with a purified H. pylori enzyme such 

35 that the action of the enzyme produces an easily detectable reaction product. The efficacy 
of the compound can be assessed by generating dose response curves from data obtained 
using various concentrations of the test compound. Moreover, a control assay can also be 
performed to provide a baseline for comparison. Suitable products include those with 
distinctive absorption, fluorescence, or chemi-luminescence properties, for example, 

40 because detection may be easily automated. A variety of synthetic or naturally occurring 
compounds may be tested in the assay to identify those which inhibit or potentiate the 
activity of the H. pylori enzyme. Some of these active compounds may directly, or with 
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5 chemical alterations to promote membrane permeability or solubility, also inhibit or 
potentiate the same enzymatic activity in whole, live H. pylori cells. 

Antibodies 

The invention also includes antibodies specifically reactive with the subject H. 
1 0 pylori-like polypeptide. Anti-protein/anti-peptide antisera or monoclonal antibodies can be 
made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by 
Harlow and Lane (Cold Spring Harbor Press: 1 988)). A mammal such as a mouse, a 
hamster or rabbit can be immunized with an immunogenic form of the peptide. Techniques 
for conferring immunegenicity on a protein or peptide include conjugation to carriers or 
1 5 other techniques well known in the art. An immunogenic portion of the subject H. pylori 
polypeptide can be administered in the presence of adjuvant. The progress of immunization 
can be monitored by detection of antibody titers in plasma or serum. Standard ELISA or 
other immunoassays can be used with the immunogen as antigen to assess the levels of 
antibodies. In a preferred embodiment, the subject antibodies are immunospecific for 
20 antigenic determinants of the H. pylori polypeptides of the invention, e.g. antigenic 

determinants of a polypeptide shown in SEQ ID NOs: 1 1 5-228 or a closely related human or 
non-human mammalian homotog (e.g. 90% percent homologous, more preferably at least 
95 percent homologous). In yet a further preferred embodiment of the present invention, 
the anti-//. pylori antibodies do not substantially cross react (i.e. react specifically) with a 
25 protein which is: e.g., less than 80% percent homologous to a sequence shown in SEQ ID 
NOs: 1 15-228. By "not substantially cross react", it is meant that the antibody has a binding 
affinity for a non-homologous protein which is less than 10 percent, more preferably less 
than 5 percent, and even more preferably less than 1 percent, of the binding affinity for a 
protein contained in SEQ ID NOs:l 1 5-228. In a most preferred embodiment, there is no 
30 crossreactivity between bacterial and mammalian antigens. 

The term antibody as used herein is intended to include fragments thereof which are 
also specifically reactive with H. pylori polypeptides. Antibodies can be fragmented using 
conventional techniques and the fragments screened for utility in the same manner as 
described above for whole antibodies. For example, F(ab') 2 fragments can be generated by 
3 5 treating antibody with pepsin. The resulting F(ab')2 fragment can be treated to reduce 
disulfide bridges to produce Fab" fragments. The antibody of the present invention is 
further intended to include bispecific and chimeric molecules having an anti-//. pylori 
portion. 

Both monoclonal and polyclonal antibodies (Ab) directed against H. pylori 
40 polypeptides or H. pylori polypeptide variants, and antibody fragments such as Fab" and 
F(ab) 2 , can be used to block the action of//, pylori polypeptide and allow the study of the 
role of a particular H. pylori polypeptide of the present invention in aberrant or unwanted 
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5 intracellular signaling, as well as the normal cellular function of the H. pylori and by 
microinjection of anti-//. pylori polypeptide antibodies of the present invention. 

Antibodies which specifically bind H. pylori epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the abundance and 
pattern of expression of H. pylori antigens. Anti-//. pylori polypeptide antibodies can be 

1 0 used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate H. 
pylori levels in tissue or bodily fluid as part of a clinical testing procedure. Likewise, the 
ability to monitor H. pylori polypeptide levels in an individual can allow determination of 
the efficacy of a given treatment regimen for an individual afflicted with such a disorder. 
The level of an H pylon polypeptide can be measured in cells found in bodily fluid, such 

15 as in urine samples or can be measured in tissue, such as produced by gastric biopsy. 

Diagnostic assays using anti-//. pylori antibodies can include, for example, immunoassays 
designed to aid in early diagnosis of//, pylori infections. The present invention can also be 
used as a method of detecting antibodies contained in samples from individuals infected by 
this bacterium using specific H. pylori antigens. 

20 Another application of anti-//. pylori polypeptide antibodies of the present invention 

is in the immunological screening of cDNA libraries constructed in expression vectors such 
as Xgi 1 1 , A.gtl 8-23, XZAP, and XORF8. Messenger libraries of this type, having coding 
sequences inserted in the correct reading frame and orientation, can produce fusion 
proteins. For instance, Xgtl 1 will produce fusion proteins whose amino termini consist of 

25 B-galactosidase amino acid sequences and whose carboxy termini consist of a foreign 
polypeptide. Antigenic epitopes of a subject H. pylori polypeptide can then be detected 
with antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates 
with anti-//. pylori polypeptide antibodies. Phage, scored by this assay, can then be 
isolated from the infected plate. Thus, the presence of//, pylori gene homologs can be 

30 detected and cloned from other species, and alternate isoforms (including splicing variants) 
can be detected and cloned. 

Vaccines 

The present invention also includes vaccine compositions for protection against 
35 infection by K pylori or for treatment of H. pylori infection, a gram-negative spiral 
microaerophilic bacterium. In one embodiment, the vaccine compositions contain 
immunogenic surface proteins from H. pylori, or portion thereof, and a pharmaceutical^' 
acceptable carrier. Nucleic acids within the scope of the invention are exemplified by the 
nucleic acids shown in SEQ ID NOs:l-l 14 and which encode H. pylori surface proteins 
40 shown in SEQ ID NOs:l 15-228. However, any nucleic acid encoding an immunogenic H. 
pylori protein, or portion thereof, which is capable of expression in a cell, can be used in 
the present invention. These vaccines can have therapeutic and prophylactic utilities. 
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5 Another aspect of the present invention provides vaccine compositions for 

protection against infection by K pylori or for treatment of H. pylori infection, which 
contain a modified immunogenic H. pylori protein or portion thereof, and a 
pharmaceutically acceptable carrier. It is possible to modify the structure of a K pylori 
protein or peptide for such purposes as increasing solubility, enhancing stability (e.g., shelf 
1 0 life ex vivo and resistance to proteolytic degradation in vivo). A modified H. pylori protein 
or peptide can be produced in which the amino acid sequence has been altered, such as by 
amino acid substitution, deletion, or addition. 

Another example of modification of an K pylori peptide is substitution of cysteine 
residues preferably with alanine, serine, threonine, leucine or glutamic acid residues to 
15 minimize dimerization via disulfide linkages. In addition, amino acid side chains of 
fragments of the protein of the invention can be chemically modified. Another 
modification is cyclization of the peptide. 

In order to enhance stability and/or reactivity, an K pylori protein or peptide can be 
modified to incorporate one or more polymorphisms in the amino acid sequence of the 
20 protein resulting from any natural allelic variation. Additionally, D-amino acids, non- 
natural amino acids, or non-amino acid analogs can be substituted or added to produce a 
modified protein within the scope of this invention. Furthermore, an H.pylori protein can 
be modified using polyethylene glycol (PEG) according to the method of A. Sehon and co- 
workers (Wie et al., supra) to produce a protein conjugated with PEG. In addition, PEG 
25 can be added during chemical synthesis of the protein. Other modifications ofH. pylori 

proteins include reduction/alkylation (Tarr, Methods of Protein Microcharaaerization, J. E. 
Silver ed., Humana Press, Clifton NJ 155-194 (1986)); acylation (Tarr, supra): chemical 
coupling to an appropriate carrier (Mishell and Shiigi, eds. Selected Methods in Cellular 
Immunology, WH Freeman, San Francisco, CA (1980), U.S. Patent 4,939,239; or mild 
30 formalin treatment (Marsh, (1971) Int. Arch of Allergy and Appi Immunol. 4k 199 - 215). 

To facilitate purification and potentially increase solubility of an H. pylori protein 
or peptide, it is possible to add an amino acid fusion moiety to the peptide backbone. For 
example, hexa-histidine can be added to the protein for purification by immobilized metal 
ion affinity chromatography (Hochuli, E. et al., (1988) Bio/Technology\ 6: 1321 - 1325). In 
35 addition, to facilitate isolation of peptides free of irrelevant sequences, specific 

endoprotease cleavage sites can be introduced between the sequences of the fusion moiety 
and the peptide. 

To potentially aid proper antigen processing of T cell epitopes within an H. pylori 
protein of the invention, canonical protease sensitive sites can be engineered between 
40 regions, each comprising at least one T cell epitope via recombinant or synthetic methods. 
For example, charged amino acid pairs, such as KK or RR, can be introduced between 
regions within a protein or fragment during recombinant construction thereof. The 
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5 resulting peptide can be rendered sensitive to cleavage by cathepsin and/or other trypsin- 
like enzymes which would generate portions of the protein containing one or more T cell 
epitopes. In addition, such charged amino acid residues can result in an increase in the 
solubility of the peptide. 

Site-directed mutagenesis of a nucleic acid encoding an H. pylori protein can be 

10 used to modify the structure of the peptide by methods known in the art. Such methods 

may, among others, include polymerase chain reaction (PCR) with oligonucleotide primers 
bearing one or more mutations (Ho et aL, (1989) Gene, 77: 51 - 59) or total synthesis of 
mutated genes (Hostomsky, Z. et aL (1989) Biochem. Biophys. Res. Comm. \6l: 1056 - 
1 063). To enhance recombinant protein expression, the aforementioned methods can be 

15 applied to change the codons present in the cDNA sequence of the invention to those 

preferentially utilized by the host cell in which the recombinant protein is being expressed 
(Wada et al., supra). An extensive discussion of mutagenesis protocols is provided in the 
"Production of Fragments and Analogs 1 ' section herein. 

Another aspect of the invention provides a vaccine composition for protection 

20 against infection by H. pylori which contains an immunogenic fragment of an K pylori 
protein or portion thereof, and a pharmaceutical ly acceptable carrier. Preferred fragments 
include peptides of at least about 10 amino acid residues in length, preferably about 10-20 
amino acid residues in length, and more preferably about 12-16 amino acid residues in 
length. 

25 Immunogenic peptides of the invention can be obtained, for example, by screening 

peptides recombinantly produced from the corresponding fragment of the nucleic acid 
encoding the full-length H. pylori protein. In addition, fragments can be chemically 
synthesized using techniques known in the art such as conventional Merrifield solid phase 
f-Moc or t-Boc chemistry. For example, an H. pylori protein may be arbitrarily divided 

30 into fragments of desired length with no overlap of the fragments, or preferably divided 
into overlapping fragments of a desired length. The fragments can be produced 
(recombinantly or by chemical synthesis) and tested to identify those peptides having the 
ability to induce a T cell response, such as stimulation (proliferation, cytokine secretion). 
An extensive discussion of peptide analogs and fragments is provided in the "Production of 

35 Fragments and Analogs 1 ' section herein. 

In one embodiment, immunogenic H. pylori fragments can be identified by the 
ability of the peptide to stimulate T cells. Peptides which stimulate T cells, as determined 
by, for example, T cell proliferation or cytokine secretion are defined herein as comprising 
at least one T cell epitope. T cell epitopes are believed to be involved in initiation and 

40 perpetuation of the immune response to the protein allergen which is responsible for the 
clinical symptoms of allergy. These T cell epitopes are thought to trigger early events at 
the level of the T helper cell by binding to an appropriate HLA molecule on the surface of 
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5 an antigen presenting cell, thereby stimulating the T cell subpopulation with the relevant T 
cell receptor for the epitope. These events lead to T cell proliferation, lymphokine 
secretion, local inflammatory reactions, recruitment of additional immune cells to the site 
of antigen/T cell interaction, and activation of the B cell cascade, leading to the production 
of antibodies. One isotype of these antibodies, IgE, is fundamentally important to the 

10 development of allergic symptoms and its production is influenced early in the cascade of 
events at the level of the T helper cell, by the nature of the lymphokines secreted. A T cell 
epitope is the basic element, or smallest unit of recognition by a T cell receptor, where the 
epitope comprises amino acids essential to receptor recognition. Amino acid sequences 
which mimic those of the T cell epitopes and which modify the allergic response to protein 

1 5 allergens are within the scope of this invention. 

In another embodiment, immunogenic components of the invention are identified 
through genomic vaccination. The basic protocol is based on the idea that expression 
libraries consisting of all or parts of a pathogen genome, e.g., an K pylori genome, can 
confer protection when used to genetically immunize a host. This expression library 

20 immunization (ELI) is analogous to expression cloning and involves reducing a genomic 
expression library of a pathogen, e.g., H. pylori, into plasmids that can act as genetic 
vaccines. The plasmids can also be designed to encode genetic adjuvants which can 
dramatically stimulate the humoral response. These genetic adjuvants can be introduced at 
remote sites and act as well extracelluraly as intracellular!}' . 

25 This is a new approach to vaccine production that has many of the advantages of 

live/attenuated pathogens but no risk of infection. An expression library of pathogen DNA 
is used to immunize a host thereby producing the effects of antigen presentation of a live 
vaccine without the risk. For example, in the present invention, random fragments from the 
H. pylori genome or from cosmid or plasmid clones, as well as PCR products from genes 

30 identified by genomic sequencing, can be used to immunize a host. The feasibility of this 
approach has been demonstrated with Mycoplasma pulmonis (Barry et al., Nature 377:632- 
635, 1 995). where even partial expression libraries of Mycoplasma pulmonis, a natural 
pathogen in rodents, provided protection against challenge from the pathogen. 

ELI is a technique that allows for production of a non-infectious multipartite 

35 vaccine, even when little is known about pathogen's biology, because ELI uses the immune 
system to screen candidate genes. Once isolated, these genes can be used as genetic 
vaccines or for development of recombinant protein vaccines. Thus, ELI allows for 
production of vaccines in a systematic, largely mechanized fashion. 

Screening peptides for those which are immunogenic can be accomplished using 

40 one or more of several different assays. For example, in vitro, peptide T cell stimulatory 

activity is assayed by contacting a peptide known or suspected of being immunogenic with 
an antigen presenting cell which presents appropriate MHC molecules in a T cell culture. 
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5 Presentation of an immunogenic H. pylori peptide in association with appropriate MHC 
molecules, to T cells, in conjunction with the necessary costimulation, has the effect of 
transmitting a signal to the T cell that induces the production of increased levels of 
cytokines, particularly of interleukin-2 and interleukin-4. The culture supernatant can be 
obtained and assayed for interleukin-2 or other known cytokines. For example, any one of 

1 0 several conventional assays for interleukin-2 can be employed, such as the assay described 
in Proc. Natl. Acad Sci USA, 86: 1333 (1989) the pertinent portions of which are 
incorporated herein by reference. A kit for an assay for the production of interferon is also 
available from Genzyme Corporation (Cambridge, MA). 

Alternatively, a common assay for T cell proliferation entails measuring rririated 

15 thymidine incorporation. The proliferation of T cells can be measured in vitro by 

determining the amount of ^H-labeled thymidine incorporated into the replicating DNA of 
cultured cells. Therefore, the rate of DNA synthesis and. in turn, the rate of cell division 
can be quantified. 

To produce modified proteins or immunogenic fragments by recombinant DNA 

20 techniques, an expression vector containing a nucleic acid encoding all or a portion of a H. 
pylori protein, operably linked to at least one regulatory sequence can be used. Operably 
linked is intended to mean that the nucleotide sequence is linked to a regulatory sequence in 
a manner which allows expression of the nucleotide sequence. Regulatory sequences are 
art-recognized and include promoters, enhancers and other expression control elements. 

25 Such regulatory sequences are described in Goeddel, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press. San Diego, CA (1990). It should be 
understood that the design of the expression vector may depend on such factors as the 
choice of the host cell to be transformed and/or the type of protein desired to be expressed. 
In one embodiment of the present invention, the expression vector includes nucleic acid, 

30 preferably a DNA, encoding a modified H. pylori protein or immunogenic fragment having 
all or a portion of the amino acid sequence . Such expression vectors can be used to 
transfect cells to thereby produce proteins or peptides, including fusion proteins or peptides 
encoded by nucleic acids as described herein. 

Host cells suitable for transfection and recombinant production of H. pylori proteins 

35 of the invention include any procaryotic or eucaryotic cell. For example, an H. pylori 
protein or peptide may be expressed in bacterial cells such as E. coli, insect cells 
(baculovirus), yeast, or mammalian cells such as Chinese hamster ovary cells (CHO). 
Other suitable host cells can be found in Goeddel, (1990) supra or known to those skilled in 
the art. 

40 H. pylori proteins and fragments of the invention can also be chemically 

synthesized, using techniques known in the art such as conventional Merrifield solid phase 
f-Moc or t-Boc chemistry. The nucleic acids of the invention can also be chemically 
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5 synthesized using standard techniques. Various methods of chemically synthesizing 
polydeoxynucleotides are known, including solid-phase synthesis which, like peptide 
synthesis, has been fully automated in commercially available DNA synthesizers (see e.g.. 
Itakura et al. U.S. Patent No. 4,598,049; Caruthers et aL U.S. Patent No. 4.458.066; and 
Itakura. U.S. Patent Nos. 4,401,796 and 4,373,071. incorporated by reference herein). 
10 Vaccine compositions of the present invention containing DNA encoding 

immunogenic protein from K pylori, or containing modified protein or fragments, contain 
both the DNA or protein and a pharmaceutical^ acceptable carrier. The term 
"pharmaceutical^ acceptable carrier" refers to a carrier that does not cause an allergic 
reaction or other untoward effect in patients to whom it is administered. Suitable 
1 5 pharmaceutical^ acceptable carriers include, for example, one or more of water, saline. 

phosphate buffered saline, dextrose, glycerol, ethanol and the like, as well as combinations 
thereof. Pharmaceutical^ acceptable carriers may further comprise minor amounts of 
auxiliary substances such as wetting or emulsifying agents, preservatives or buffers, which 
enhance the shelf life or effectiveness of the antibody. For vaccines of the invention 
20 containing modified H. pylori protein or immunogenic protein fragments, the protein or 
peptide must be coadministered with a suitable adjuvant. 

It will be apparent to those of skill in the art that the therapeutically effective 
amount of DNA or protein of this invention will depend, inter alia, upon the administration 
schedule, the unit dose of antibody administered, whether the protein or DNA is 
25 administered in combination with other therapeutic agents, the immune status and health of 
the patient, and the therapeutic activity of the particular protein or DNA. 

Vaccine compositions are conventionally administered parenterally. e.g., by 
injection, either subcutaneously or intramuscularly. Methods for intramuscular 
immunization are described by Wolff et al. (1990) Science 247: 1465-1468 and by Sedegah 
30 et al. (1994) Immunology £]_: 9866-9870. Other modes of administration include oral and 
pulmonary formulations, suppositories, and transdermal applications. Oral immunization is 
preferred over parenteral methods for inducing protection against infection by H. pylori. 
Czinn et. al. (1993) Vaccine 11: 637-642. Oral formulations include such normally 
employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, 
35 magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like. 

The vaccine compositions of the invention can include an adjuvant, including, but 
not limited to aluminum hydroxide; N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr- 
MDP); N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 1 1637, referred to as nor- 
MDP); N-acetylmuramyl-L-alanyl-D-iso 
40 glycero-3-hydroxyphos-phoryloxy)-ethylamine (CGP 1 9835A, referred to a MTP-PE); 

RIBI, which contains three components from bacteria; monophosphoryl lipid A: trehalose 
dimycoloate; cell wall skeleton (MPL + TDM + CWS) in a 2% squalene/Tween 80 



BNSDOCID: <WO 9719098A1_I_> 



WO 97/19098 



PCT/US96/18542 



-34- 

5 emulsion; and cholera toxin. Others which may be used are non-toxic derivatives of 

cholera toxin, including its B subunit, and/or conjugates or genetically engineered fusions 
of the H. pylori polypeptide with cholera toxin or its B subunit, procholeragenoid. fungal 
polysaccharides, including schizophyllan, muramyl dipeptide, muramyl dipeptide 
derivatives, phorbol esters, labile toxin of E. coli, non-//. pylori bacterial lysates. block 

1 0 polymers or saponins. 

Other suitable delivery methods include biodegradable microcapsules or immuno- 
stimulating complexes (ISCOMs), cochleates, or liposomes, genetically engineered 
attenuated live vectors such as viruses or bacteria, and recombinant (chimeric) virus-like 
particles, e.g., hluetnngue. The amount of adjuvant employed will depend on the type of 

1 5 adjuvant used. For example, when the mucosal adjuvant is cholera toxin, it is suitably used 
in an amount of 5 ng to 50 jig, for example 1 0 ng to 35 jag. When used in the form of 
microcapsules, the amount used will depend on the amount employed in the matrix of the 
microcapsule to achieve the desired dosage. The determination of this amount is within the 
skill of a person of ordinary skill in the art. 

20 Carrier systems in humans may include enteric release capsules protecting the 

antigen from the acidic environment of the stomach, and including H. pylori polypeptide in 
an insoluble form as fusion proteins. Suitable carriers for the vaccines of the invention are 
enteric coated capsules and polylactide-glycolide microspheres. Suitable diluents are 0.2 N 
NaHC03 and/or saline. 

25 Vaccines of the invention can be administered as a primary prophylactic agent in 

adults or in children, as a secondary prevention, after successful eradication of H. pylori in 
an infected host, or as a therapeutic agent with the aim to induce an immune response in a 
susceptible host to prevent infection by H. pylori. The vaccines of the invention are 
administered in amounts readily determined by persons of ordinary skill in the art. Thus, 

30 for adults a suitable dosage will be in the range of 1 0 ng to 1 0 g ? preferably 1 0 jag to 100 
mg, for example 50 jag to 50 mg. A suitable dosage for adults will also be in the range of 5 
lag to 500 mg. Similar dosage ranges will be applicable for children. Those skilled in the 
art will recognize that the optimal dose may be more or less dependant upon the patient's 
body weight, disease, the route of administration, and other factors. Those skilled in the art 

35 will also recognize that appropriate dosage levels can be obtained based on results with 
known oral vaccines such as, for example, a vaccine based on an £. coli lysate (6 mg dose 
daily up to total of 540 mg) and with an enterotoxigenic E. coli purified antigen (4 doses of 
1 mg) (Schulman etal.,J Urol 150:917-921 (1993); Boedecker et al., American 
Gastroenterological Assoc. 999:A-222 (1993)). The number of doses will depend upon the 

40 disease, the formulation, and efficacy data from clinical trials. Without intending any 

limitation as to the course of treatment, the treatment can be administered over 3 to 8 doses 
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5 for a primary immunization schedule over 1 month (Boedeker. American 
Gastroenterological Assoc. 888:A-222 (1993)). 

In a preferred embodiment, a vaccine composition of the invention can be based on 
a killed whole E. coli preparation with an immunogenic fragment of an H. pylori protein of 
the invention expressed on its surface or it can be based on an E. coli lysate, wherein the 

1 0 killed E. coli acts as a carrier or an adjuvant. 

It will be apparent to those skilled in the art that some of the vaccine compositions 
of the invention are useful only for preventing H. pylori infection, some are useful only for 
treating H. pylori infection, and some are useful for both preventing and treating H. pylori 
infection. Tn a preferred embodiment; the vaccine composition of the invention provides 

1 5 protection against H. pylori infection by stimulating humoral and/or cell -mediated 
immunity against H. pylori. It should be understood that amelioration of any of the 
symptoms of H. pylori infection is a desirable clinical goal including a lessening of the 
dosage of medication used to treat H. pylori-caused disease, or an increase in the 
production of antibodies in the serum or mucous of patients. 

20 

Determination of candidate protein antigens for antibody an d vaccine development 

The selection of candidate protein antigens for vaccine development were derived 
from the nucleotide sequence. First, all possible open reading frames (ORF's) greater than 
50 nucleotides in all six reading frames were identified and translated into amino acid 
25 sequences. Second, the identified ORFs were analyzed for homology to other known 

exported or membrane proteins and the ORF's were also analyzed using the discriminant 
analysis described by Klein, et al. (Klein. P., Kanehsia, M., and DeLisi. C. (1985) 
Biochimica et Biophysica Acta 8 1 5. 468-476) for predicting exported and membrane 
proteins. 

30 Homology searches were performed using the BLAST algorithm contained in the 

Wisconsin Sequence Analysis Package (Genetics Computer Group, University Research 
Park, 575 Science Drive, Madison, Wl 5371 1) to compare each predicted ORF amino acid 
sequence with all sequences found in the current GenBank, SWISS-PROT and PIR 
databases. BLAST searches for local alignments between the ORF and the databank 

35 sequences and reports a probability score which indicates the probability of finding this 
sequence by chance in the database. ORF's with significant homology (e.g. probabilities 
better than 1x10 (ee-6)) to membrane or exported proteins represent likely protein antigens 
for vaccine development. Possible functions are provided to some of the H. pylori genes as 
indicated in Figure 560 based on sequence homology to genes cloned in other organisms. 

40 Discriminant analysis (Klein, et al. supra) was used to examine the ORF amino acid 

sequences using our own software. This algorithm uses the intrinsic information contained 
in the ORF amino acid sequence and compares it to information derived from the properties 
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of known membrane and exported proteins. This comparison predicts which proteins will 
be exported, membrane associated or cytoplasmic. ORF amino acid sequences identified as 
exported or membrane associated by this algorithm are likely protein antigens for vaccine 
development. 

Infrequently it is not possible to distinguish between multiple possible nucleotides 
at a given position in the nucleic acid sequence. In those cases the ambiguities are denoted 
by an extended alphabet as follows: 

These are the official IUPAC-IUB single-letter base codes 



Code 


Base Description 




G 


Guanine 




A 


Adenine 




T 


Thymine 




C 


Cytosine 




R 


Purine 


(A or G) 


Y 


Pyrimidine 


(C or T or U) 


M 


Amino 


(A or C) 


K 


Ketone 


(G or T) 


S 


Strong interaction 


(C or G) 


w 


Weak interaction 


(A or T) 


H 


Not-G 


(A or C or T) 


B 


Not-A 


(C or G or T) 


V 


Not-T (not-U) 


(A or C or G) 


D 


Not-C 


(A or G or T) 


N 


Any 


(A or C or G or T) 



The amino acid translations of this invention account for the ambiguity in the 
nucleic acid sequence by translating the ambiguous codon as the letter "X". In all cases, the 
permissible amino acid residues at a position are clear from an examination of the nucleic 
acid sequence based on the standard genetic code. 

20 

Production of Fragments and Analogs 

The inventor has discovered novel gene products, e.g. bacterial surface gene 
products, from the organism H. pylori. Once an example of this core structure has been 
provided one skilled in the art can alter the disclosed structure (of//, pylori genes), e.g., by 

25 producing fragments or analogs, and test the newly produced structures for activity. 

Examples of prior art methods which allow the production and testing of fragments and 
analogs are discussed below. These, or analogous methods can be used to make and screen 
libraries_of polypeptides, e.g., libraries of random peptides or libraries of fragments or 
analogues of cellular proteins for the ability to bind K pylori polypeptides. Such screens 

30 are useful for discovery of inhibitors of K pylori. 
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5 Generation of Fragments 

Fragments of a protein can be produced in several ways, e.g., recombinantly, by 
proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a 
polypeptide can be generated by removing one or more nucleotides from one end (for a 
terminal fragment) or both ends (for an internal fragment) of a nucleic acid which encodes 
1 0 the polypeptide. Expression of the mutagenized DNA produces polypeptide fragments. 
Digestion with "end-nibbling" endonucleases can thus generate DNA's which encode an 
array of fragments. DNA's which encode fragments of a protein can also be generated by 
random shearing, restriction digestion or a combination of the above-discussed methods. 
Fragments can also be chemically synthesized using techniques known in the art 
1 5 such as conventional Merrifield solid phase f-Moc or t-Boc chemistry- For example, 

peptides of the present invention may be arbitrarily divided into fragments of desired length 
with no overlap of the fragments, or divided into overlapping fragments of a desired length. 

Production of Altered DNA and Peptide Sequences: Random Methods 
20 Amino acid sequence variants of a protein can be prepared by random mutagenesis 

of DNA which encodes a protein or a particular domain or region of a protein. Useful 
methods include PGR mutagenesis and saturation mutagenesis. A library of random amino 
acid sequence variants can also be generated by the synthesis of a set of degenerate 
oligonucleotide sequences. (Methods for screening proteins in a library of variants are 
25 elsewhere herein). 

PCR Mutagenesis 

In PCR mutagenesis, reduced Taq polymerase fidelity is used to introduce random 
mutations into a cloned fragment of DNA (Leung et al., 1989, Technique 1:11-15). This is 

30 a very powerful and relatively rapid method of introducing random mutations. The DNA 
region to be mutagenized is amplified using the polymerase chain reaction (PCR) under 
conditions that reduce the fidelity of DNA synthesis by Taq DNA polymerase, e.g., by 
using a dGTP/dATP ratio of five and adding Mn2 + to the PCR reaction. The pool of 
amplified DNA fragments are inserted into appropriate cloning vectors to provide random 

35 mutant libraries. 

Saturation Mutagenesis 

Saturation mutagenesis allows for the rapid introduction of a large number of single 
base substitutions into cloned DNA fragments (Mayers et al., 1 985, Science 229:242). This 
40 technique includes generation of mutations, e.g., by chemical treatment or irradiation of 
single-stranded DNA in vitro, and synthesis of a complimentary DNA strand. The 
mutation frequency can be modulated by modulating the severity of the treatment, and 
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5 essentially all possible base substitutions can be obtained. Because this procedure does not 
involve a genetic selection for mutant fragments both neutral substitutions, as well as those 
that alter function, are obtained. The distribution of point mutations is not biased toward 
conserved sequence elements. 

10 Degenerate Oligonucleotides 

A library of homologs can also be generated from a set of degenerate 
oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried 
out in an automatic DNA synthesizer, and the synthetic genes then ligated into an 
appropriate expression vector. The synthesis of degenerate oligonucleotides is known in 

15 the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) 
Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, 
Amsterdam: Elsevier pp273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura 
et al. (\9%4) Science 198:1056; Ike et al. (1983) Nucleic A cid Res. 11:477. Such techniques 
have been employed in the directed evolution of other proteins (see, for example, Scott et 

20 al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429-2433; Devlin et al. 
(1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378-6382; as well as U.S. 
Patents Nos. 5,223.409, 5,198,346, and 5,096,815). 

Production of Altered DNA and Peptide Sequences: Methods for Directed Mutagenesis 
25 Non-random or directed, mutagenesis techniques can be used to provide specific 

sequences or mutations in specific regions. These techniques can be used to create variants 
which include, e.g., deletions, insertions, or substitutions, of residues of the known amino 
acid sequence of a protein. The sites for mutation can be modified individually or in series, 
e.g., by (1 ) substituting first with conserved amino acids and then with more radical choices 
30 depending upon results achieved, (2) deleting the target residue, or (3) inserting residues of 
the same or a different class adjacent to the located site, or combinations of options 1-3. 

Alanine Scanning Mutagenesis 

Alanine scanning mutagenesis is a useful method for identification of certain 
35 residues or regions of the desired protein that are preferred locations or domains for 

mutagenesis, Cunningham and Wells (Science 244:1081-1085, 1989). In alanine scanning, 
a residue or group of target residues are identified (e.g., charged residues such as Arg, Asp, 
His, Lys, and Glu) and replaced by a neutral or negatively charged amino acid (most 
preferably alanine or polyalanine). Replacement of an amino acid can affect the interaction 
40 of the amino acids with the surrounding aqueous environment in or outside the cell. Those 
domains demonstrating functional sensitivity to the substitutions are then refined by 
introducing further or other variants at or for the sites of substitution. Thus, while the site 
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for introducing an amino acid sequence variation is predetermined, the nature of the 
mutation per se need not be predetermined. For example, to optimize the performance of a 
mutation at a given site, alanine scanning or random mutagenesis may be conducted at the 
target codon or region and the expressed desired protein subunit variants are screened for 
the optimal combination of desired activity. 

Oligonucleotide-Mediated Mutagenesis 

Oligonucleotide-mediated mutagenesis is a useful method for preparing 
substitution, deletion, and insertion variants of DNA, see, e.g., Adelman et al., (DNA 2:183, 
1983). Briefly, the desired ON A is altered by hybridizing an oligonucleotide encoding a 
mutation to a DNA template, where the template is the single-stranded form of a plasmid or 
bacteriophage containing the unaltered or native DNA sequence of the desired protein. 
After hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template that will thus incorporate the oligonucleotide primer, 
and will code for the selected alteration in the desired protein DNA. Generally, 
oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide 
will have 12 to 15 nucleotides that are completely complementary to the template on either 
side of the nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will 
hybridize properly to the single-stranded DNA template molecule. The oligonucleotides 
are readily synthesized using techniques known in the art such as that described by Crea et 
al. (Proc. Natl. Acad ScL USA, 75: 5765[1978]). 

Cassette Mutagenesis 

Another method for preparing variants, cassette mutagenesis, is based on the 
technique described by Wells et al. (Gene, 34:3 1 5[1985]). The starting material is a 
plasmid (or other vector) which includes the protein subunit DNA to be mutated. The 
codon(s) in the protein subunit DNA to be mutated are identified. There must be a unique 
restriction endonuclease site on each side of the identified mutation site(s). If no such 
restriction sites exist, they may be generated using the above-described oligonucleotide- 
mediated mutagenesis method to introduce them at appropriate locations in the desired 
protein subunit DNA. After the restriction sites have been introduced into the plasmid, the 
plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the 
sequence of the DNA between the restriction sites but containing the desired mutation(s) is 
synthesized using standard procedures. The two strands are synthesized separately and then 
hybridized together using standard techniques. This double-stranded oligonucleotide is 
referred to as the cassette. This cassette is designed to have 3' and 5* ends that are 
compatible with the ends of the linearized plasmid, such that it can be directly ligated to the 
plasmid. This plasmid now contains the mutated desired protein subunit DNA sequence. 
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5 Combinatorial Mutagenesis 

Combinatorial mutagenesis can also be used to generate mutants (Ladner et al., WO 
88/06630). In this method, the amino acid sequences for a group of homologs or other 
related proteins are aligned, preferably to promote the highest homology possible. All of 
the amino acids which appear at a given position of the aligned sequences can be selected 

10 to create a degenerate set of combinatorial sequences. The variegated library of variants is 
generated by combinatorial mutagenesis at the nucleic acid level, and is encoded by a 
variegated gene library. For example, a mixture of synthetic oligonucleotides can be 
enzymatically ligated into gene sequences such that the degenerate set of potential 
sequences are expressible as individual peptides, or alternatively, as a set of larger fusion 

15 proteins containing the set of degenerate sequences. 

Primary High-Through-Put Methods for Screening Libraries of Peptide Fragments or 
Homologs 

Various techniques are known in the art for screening generated mutant gene 
20 products. Techniques for screening large gene libraries often include cloning the gene 

library into replicable expression vectors, transforming appropriate cells with the resulting 
library of vectors, and expressing the genes under conditions in which detection of a 
desired activity, e.g., in this case, binding to H. pylori polypeptide or an interacting protein, 
facilitates relatively easy isolation of the vector encoding the gene whose product was 
25 detected. Each of the techniques described below is amenable to high through-put analysis 
for screening large numbers of sequences created, e.g., by random mutagenesis techniques. 

Two Hybrid Systems 

Two hybrid assays such as the system described above (as with the other screening 
30 methods described herein), can be used to identify polypeptides, e.g.,fragments or analogs 
of a naturally-occurring H. pylori polypeptide, e.g., of cellular proteins, or of randomly 
generated polypeptides which bind to an H. pylori protein. (The H. pylori domain is used 
as the bait protein and the library of variants are expressed as fish fusion proteins.) In an 
analogous fashion, a two hybrid assay (as with the other screening methods described 
35 herein), can be used to find polypeptides which bind a H. pylori polypeptide. 

Display Libraries 

In one approach to screening assays, the candidate peptides are displayed on the 
surface of a cell or viral particle, and the ability of particular cells or viral particles to bind 
40 an appropriate receptor protein via the displayed product is detected in a "panning assay". 

For example, the gene library can be cloned into the gene for a surface membrane protein of 
a bacterial cell, and the resulting fusion protein detected by panning (Ladner et al., WO 
88/06630; Fuchs et al. (1991) Bio/Technology 9: 1370-1 371; and Goward et al. (1992) TIBS 

BNSDOCID: <WO 9719098A1_L> 



WO 97/19098 



PCT/US96/I8542 



-41- 

5 18:136-140). In a similar fashion, a detectably labeled ligand can be used to score for 

potentially functional peptide homologs. Fluorescently labeled Iigands, e.g., receptors, can 
be used to detect homologs which retain ligand-binding activity. The use of fluorescently 
labeled Iigands, allows cells to be visually inspected and separated under a fluorescence 
microscope, or, where the morphology of the cell permits, to be separated by a 
] 0 fluorescence-activated cell sorter. 

A gene library can be expressed as a fusion protein on the surface of a viral particle. 
For instance, in the filamentous phage system, foreign peptide sequences can be expressed 
on the surface of infectious phage, thereby conferring two significant benefits. First, since 
these phage can be applied to affinity matrices at concentrations well over 10 13 phage per 
1 5 milliliter, a large number of phage can be screened at one time. Second, since each 

infectious phage displays a gene product on its surface, if a particular phage is recovered 
from an affinity matrix in low yield, the phage can be amplified by another round of 
infection. The group of almost identical E. coli filamentous phages Ml 3, fd.. and fl are 
most often used in phage display libraries. Either of the phage gill or gVIII coat proteins 
20 can be used to generate fusion proteins without disrupting the ultimate packaging of the 

viral particle. Foreign epitopes can be expressed at the NH 2 -terminal end of pill and phage 
bearing such epitopes recovered from a large excess of phage lacking this epitope (Ladner 
et al. PCT publication WO 90/02909; Garrard et al.. PCT publication WO 92/09690; Marks 
et al. (1992) J. Biol. Chem. 267:16007-16010; Griffiths et al. (1993) EMBOJ 12:725-734; 
25 Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461). 
A common approach uses the maltose receptor of E. coli (the outer membrane 
protein, LamB) as a peptide fusion partner (Charbit et al. (1986) EMBO 5. 3029-3037). 
Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce 
peptides fused into one of the extracellular loops of the protein. These peptides are 
30 available for binding to Iigands. e.g., to antibodies, and can elicit an immune response when 
the cells are administered to animals. Other cell surface proteins, e.g., OmpA (Schorr et al. 
(1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 37-45), and 
PAL (Fuchs et al. (1991) Bio/Tech 9, 1369-1372), as well as large bacterial surface 
structures have served as vehicles for peptide display. Peptides can be fused to pilin, a 
35 protein which polymerizes to form the pilus-a conduit for interbacterial exchange of genetic 
information (Thiry et al. (1989) Appl. Environ. Microbiol. 55, 984-993). Because of its 
role in interacting with other cells, the pilus provides a useful support for the presentation 
of peptides to the extracellular environment. Another large surface structure used for 
peptide display is the bacterial motile organ, the flagellum. Fusion of peptides to the 
40 subunit protein flagellin offers a dense array of many peptide copies on the host cells 
(Kuwajima et al. (1 988) Bio/Tech. 6, 1 080-1 083). Surface proteins of other bacterial 
species have also served as peptide fusion partners. Examples include the Staphylococcus 



BNSDOCID: <WO 9719098A1_I_> 



WO 97/19098 



PCT/US96/18542 



-42- 

5 protein A and the outer membrane protease IgA of Neisseria (Hansson et al. (1 992) J. 
Bacterial. 174, 4239-4245 and Klauser et ai. (1990) EMBOJ. 9, 1991-1999). 

In the filamentous phage systems and the LamB system described above, the 
physical link between the peptide and its encoding DNA occurs by the containment of the 
DNA within a particle (cell or phage) that carries the peptide on its surface. Capturing the 

1 0 peptide captures the particle and the DNA within. An alternative scheme uses the DNA- 
binding protein Lad to form a link between peptide and DNA (Cull et al (1992) PNAS 
USA 89:1865-1869). This system uses a plasmid containing the Lad gene with an 
oligonucleotide cloning site at its 3'-end. Under the controlled induction by arabinose. a 
Lacl-peptide fusion protein is produced. This fusion retains the natural ability of Lac! to 

1 5 bind to a short DNA sequence known as LacO operator (LacO). By installing two copies 
of LacO on the expression plasmid, the Lacl-peptide fusion binds tightly to the plasmid that 
encoded it. Because the plasmids in each cell contain only a single oligonucleotide 
sequence and each cell expresses only a single peptide sequence, the peptides become 
specifically and stably associated with the DNA sequence that directed its synthesis. The 

20 cells of the library are gently lysed and the peptide-DNA complexes are exposed to a matrix 
of immobilized receptor to recover the complexes containing active peptides. The 
associated plasmid DNA is then reintroduced into cells for amplification and DNA 
sequencing to determine the identity of the peptide ligands. As a demonstration of the 
practical utility of the method, a large random library of dodecapeptides was made and 

25 selected on a monoclonal antibody raised against the opioid peptide dynorphin B. A cohort 
of peptides was recovered, all related by a consensus sequence corresponding to a six- 
residue portion of dynorphin B. (Cull et al (1992) Proc. Natl Acad Sci. U.S.A. 89-1869) 

This scheme, sometimes referred to as peptides-on-plasmids, differs in two 
important ways from the phage display methods. First, the peptides are attached to the C- 

30 terminus of the fusion protein, resulting in the display of the library members as peptides 
having free carboxy termini. Both of the filamentous phage coat proteins, pill and pVIII, 
are anchored to the phage through their C-termini, and the guest peptides are placed into the 
outward-extending N-terminal domains. In some designs, the phage-displayed peptides are 
presented right at the amino terminus of the fusion protein. (Cwirla. et al. (1990) Proc. 

35 Natl Acad. ScL USA. 87, 6378-6382) A second difference is the set of biological biases 
affecting the population of peptides actually present in the libraries. The Lad fusion 
molecules are confined to the cytoplasm of the host cells. The phage coat fusions are 
exposed briefly to the cytoplasm during translation but are rapidly secreted through the 
inner membrane into the periplasmic compartment, remaining anchored in the membrane 

40 by their C-terminal hydrophobic domains, with the N-termini ? containing the peptides, 

protruding into the periplasm while awaiting assembly into phage particles. The peptides 
in the Lacl and phage libraries may differ significantly as a result of their exposure to 
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5 different proteolytic activities. The phage coat proteins require transport across the inner 
membrane and signal peptidase processing as a prelude to incorporation into phage. 
Certain peptides exert a deleterious effect on these processes and are underrepresented in 
the libraries (Gallop et al. (1994)*/ Med Chem. 37(9): 1233- 1251). These particular biases 
are not a factor in the LacI display system. 

1 o The number of small peptides available in recombinant random libraries is 

enormous. Libraries of 10 7 -10 9 independent clones are routinely prepared. Libraries as 
large as 10 1 1 recombinants have been created, but this size approaches the practical limit 
for clone libraries. This limitation in library size occurs at the step of transforming the 
DNA containing randomized segments into the host bacteria! cells. To circumvent this 

1 5 limitation, an in vitro system based on the display of nascent peptides in polysome 

complexes has recently been developed. This display library method has the potential of 
producing libraries 3-6 orders of magnitude larger than the currently available 
phage/phagemid or plasmid libraries. Furthermore, the construction of the libraries, 
expression of the peptides, and screening, is done in an entirely cell-free format. 

20 In one application of this method (Gallop et al. (1994) J. Med. Chem. 37(9):1233- 

1251). a molecular DNA library encoding 10 12 decapeptides was constructed and the 
library 7 expressed in an E. coli S30 in vitro coupled transcription/translation system. 
Conditions were chosen to stall the ribosomes on the mRNA, causing the accumulation of a 
substantial proportion of the RNA in polysomes and yielding complexes containing nascent 

25 peptides still linked to their encoding RNA. The polysomes are sufficiently robust to be 
affinity purified on immobilized receptors in much the same way as the more conventional 
recombinant peptide display libraries are screened. RNA from the bound complexes is 
recovered, converted to cDNA, and amplified by PCR to produce a template for the next 
round of synthesis and screening. The polysome display method can be coupled to the 

30 phage display system. Following several rounds of screening, cDNA from the enriched 

pool of polysomes was cloned into a phagemid vector. This vector serves as both a peptide 
expression vector, displaying peptides fused to the coat proteins, and as a DNA sequencing 
vector for peptide identification. By expressing the polysome-derived peptides on phage, 
one can either continue the affinity selection procedure in this format or assay the peptides 

35 on individual clones for binding activity in a phage ELISA, or for binding specificity in a 
completion phage ELISA (Barret, et al. (1992) Anal Biochem 204,357-364). To identify 
the sequences of the active peptides one sequences the DNA produced by the phagemid 
host. 



40 Secondary Screens 

The high through-put assays described above can be followed by secondary screens 
in order to identify farther biological activities which will, e.g., allow one skilled in the art 
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to differentiate agonists from antagonists. The type of a secondary screen used will depend 
on the desired activity that needs to be tested. For example, an assay can be developed in 
which the ability to inhibit an interaction between a protein of interest and its respective 
ligand can be used to identify antagonists from a group of peptide fragments isolated 
though one of the primary screens described above. 

Therefore, methods for generating fragments and analogs and testing them for 
activity are known in the art. Once the core sequence of interest is identified, it is routine to 
perform for one skilled in the art to obtain analogs and fragments. 

Peptide N4imetics 

The invention also provides for reduction of the protein binding domains of the 
subject H. pylori-like family polypeptides, e.g., an K pylori polypeptide, to generate 
mimetics, e.g. peptide or non-peptide agents. The peptide mimetics are able to disrupt 
binding of a H. pylori to its counter ligand, e.g., in the case of an H. pylori polypeptide 
binding to a naturally occurring ligand. The critical residues of a subject H. pylori 
polypeptide which are involved in molecular recognition of a polypeptide can be 
determined and used to generate H. pylori-derived peptidomimetics which competitively or 
noncompetatively inhibit binding of the H. pylori polypeptide with an interacting 
polypeptide (see, for example, "Peptide inhibitors of human papillomavirus protein binding 
to retinoblastoma gene protein" European patent applications EP-412,762A and EP- 
B3 1 ,080A). For example, scanning mutagenesis can be used to map the amino acid 
residues of a particular K pylori polypeptide involved in binding an interacting 
polypeptide, peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be 
generated which mimic those residues in binding to an interacting polypeptide, and which 
therefore can inhibit binding of an K pylori polypeptide to an interacting polypeptide and 
thereby interfere with the function of H. pylori polypeptide. For instance, non- 
hydrolyzable peptide analogs of such residues can be generated using benzodiazepine (e.g., 
see Freidinger et al. in Peptides: Chemistry and Biology^ G.R. Marshall ed., ESCOM 
Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huffman et al. in Peptides: 
Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), 
substituted gama lactam rings (Garvey et al. in Peptides: Chemistry and Biology*. G.R. 
Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto-methylene 
pseudopeptides (Ewenson et al. ( 1 986) J Med Che m 29:295; and Ewenson et al. in 
Peptides: Structure and Function (Proceedings of the 9th American Peptide Symposium) 
Pierce Chemical Co. Rockland, IL, 1985), P-turn dipeptide cores (Nagai et al. (1985) 
Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem Soc Perkin Trans 1 : 123 1 ), and (i- 
aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Commun\26A\9\ and Dann et 
al. (1986) Biochem Biophys Res Commun 134:71). 
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5 Kits 

The nucleic acid, peptides and antibodies of the present invention can be combined 
with other reagents and articles to form kits. Kits for diagnostic purposes typically 
comprise the nucleic acid, peptides or antibodies in vials or other suitable vessels. Kits 
typically comprise other reagents for performing hybridization reactions, polymerase chain 

1 0 reactions (PCR), or for reconstitution of lyophilized components, such as aqueous media, 
salts, buffers, and the like. Kits may also comprise reagents for sample processing such as 
detergents, chaotropic salts and the like. Kits may also comprise immobilization means 
such as particles, supports, wells, dipsticks and the like. Kits may also comprise labeling 
means such as dyes, developing reagents, radioisotopes, fluorescent agents, luminescent or 

1 5 chemiluminescent agents, enzymes, intercalating agents and the like. With the nucleic acid 
and amino acid sequence information provided herein, individuals skilled in art can readily 
assemble kits to serve their particular purpose. 

Exemplification 

20 1. Cloning and Sequencing of H. vvlori DNA 

H. pylori chromosomal DNA was isolated according to a basic DNA protocol 
outlined in Schleif R.F. and Wensink P.C., Practical Methods in Molecular Biology, p.98. 
Springer-Verlag, NY., 1981, with minor modifications. Briefly, cells were pelleted, 
resuspended in TE (10 mM Tris, 1 mM EDTA, pH 7.6) and GES lysis buffer (5.1 M 

25 guanidium thiocyanate, 0.1 M EDTA, pH 8.0, 0.5% N-laurylsarcosine) was added. 

Suspension was chilled and ammonium acetate (NH 4 Ac) was added to final concentration 
of 2.0 M. DNA was extracted, first with chloroform, then with phenol-chloroform, and 
reextracted with chloroform. DNA was precipitated with isopropanol, washed twice with 
70% EtOH, dried and resuspended in TE. 

30 Following isolation whole genomic H. pylori DNA was nebulized (Bodenteich et 

al., Automated DNA Sequencing and Analysis (J.C. Venter, ed.). Academic Press. 1994) to 
a median size of 2000 bp. After nebulization. the DNA was concentrated and separated on 
a standard 1% agarose gel. Several fractions, corresponding to approximate sizes 900-1300 
bp. 1300-1700 bp, 1700-2200 bp, 2200-2700 bp, were excised from the gel and purified by 

35 the GeneClean procedure (BiolOl, Inc.). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. 
The healed DNA was then ligated to unique BstXl-linker adapters in 100-1000 fold molar 
excess. These linkers are complimentary to the BstXl-cut pMPX vectors, while the 
overhang is not self-complimentary. Therefore, the linkers will not concatemerize nor will 

40 the cut-vector religate itself easily. The linker-adopted inserts were separated from the 
unincorporated linkers on a 1% agarose gel and purified using GeneClean. The linker- 
adapted inserts were then ligated to each of the 20 pMPX vectors to construct a series of 
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5 "shotgun" subclone libraries. The vectors contain an out-of-frame lacZ gene at the cloning 
site which becomes in-frame in the event that an adapter-dimer is cloned, allowing these to 
be avoided by their blue-color. 

All subsequent steps were based on the multiplex DNA sequencing protocols 
outlined in Church G.M. and Kieffer-Higgins S., Science 240:185-188, 1988. Only major 

10 modifications to the protocols are highlighted. Briefly, each of the 20 vectors was then 

transformed into DH5ct competent cells (Gibco/BRL, DH5a transformation protocol). The 
libraries were assessed by plating onto antibiotic plates containing ampicillin, methicillin 
and IPTG/Xgal. The plates were incubated overnight at 37°C Successful transformants 
were then used for plating of clones and pooling into the multiplex pools. The clones were 

1 5 picked and pooled into 40 ml growth medium cultures. The cultures were grown overnight 
at 37°C. DNA was purified using the Qiagen Midi-prep kits and Tip- 100 columns 
(Qiagen, Inc.). In this manner, 1 00 ^ig of DNA was obtained per pool. 1 5 96- well plates of 
DNA were generated to obtain a 5-10 fold sequence redundancy with 250-300 base average 
read-lengths. 

20 These purified DNA samples were then sequenced using the multiplex DNA 

sequencing based on chemical degradation methods (Church G.M. and Kieffer-Higgins S., 
Science 240:185-188, 1988) or by Sequithrem (Epicenter Technologies) dideoxy 
sequencing protocols. The sequencing reactions were electrophoresed and transferred onto 
nylon membranes by direct transfer electrophoresis from 40 cm gels (Richterich P. and 

25 Church G.M,, Methods in Enzemology 218:1 87-222, 1 993) or by electroblotting (Church, 

supra). 24 samples were run per gel. 45 successful membranes were produced by chemical 
sequencing and 8 were produced by dideoxy sequencing. The DNA was covalently bound 
to the membranes by exposure to ultraviolet light, and hybridized with labeled 
oligonucleotides complimentary to tag sequences on the vectors (Church, supra). The 

30 membranes were washed to rinse off non-specifically bound probe, and exposed to X-ray 
film to visualize individual sequence ladders. After autoradiography, the hybridized probe 
was removed by incubation at 65°C, and the hybridization cycle repeated with another tag 
sequence until the membrane has been probed 38 times for chemical sequencing 
membranes and 10 times for the dideoxy sequencing membranes. Thus, each gel produced 

35 a large number of films, each containing new sequencing information. Whenever a new 
blot was processed, it was initially probed for an internal standard sequence added to each 
of the pools. 

Digital images of the films were generated using a laser-scanning densitometer 
(Molecular Dynamics, Sunnyvale, CA). The digitized images were processed on computer 
40 workstations (VaxStation 4000's) using the program REPLICA™ (Church et al.. 

Automated DNA Sequenicng and Analysis (J.C. Venter, ed.), Academic Press. 1994). 
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5 Image processing included lane straightening, contrast adjustment to smooth out intensity 
differences, and resolution enhancement by iterative gaussian deconvolution. The 
sequences were then automatically picked in REPLICA™ and displayed for interactive 
proofreading before being stored in a project database. The proofreading was accomplished 
by a quick visual scan of the film image followed by mouse clicks on the bands of the 
1 0 displayed image to modify the base calls. For typical sequences derived by chemical 
sequencing, the error rate of the REPLICA™ base calling software was 2-5% with most 
errors occurring near the end of a sequence read. Many of the sequence errors could be 
detected and corrected because multiple sequence reads covering the same portion of the 
genomic DNA provide adequate sequence redundancy for editing. Each sequence 
1 5 automatically received a number correspond to (microtiter plate and probe information) and 
lane set number (corresponding to microtiter plate columns). This number serves as a 
permanent identifier of the sequence so it is always possible to identify the original of any 
particular sequence without recourse to a specialized database. 

Routine assembly of H. pylori sequences was done using the program FALCON 
20 (Church, Church et al.. Automated DNA Sequenicng and Analysis (J.C. Venter, ed.). 
Academic Press, 1 994). This program has proven to be fast and reliable for most 
sequences. The assembled contigs were displayed using a modified version of 
GelAssemble, developed by the Genetics Computer Group (GCG) (Devereux et al.. Nucleic 
Acid Res. 12:387-95, 1984) that interacts with REPLICA™. This provided for an 
25 integrated editor that allows multiple sequence gel images to be instantaneously called up 
from the REPLICA™ database and displayed to allow rapid scanning of contigs and 
proofreading of gel traces where discrepancies occurred between different sequence reads 
in the assembly. 

30 II. Identification, cloning and expression of r ecombinant H. nvlori DNA sequences 

To facilitate the cloning, expression and purification of membrane and secreted 
proteins from H. pylori a powerful gene expression system, the pET System (Novagen), for 
cloning and expression of recombinant proteins in E. coli y was selected. Also, a DNA 
sequence encoding a peptide tag, the His-Tag, was fused to the 3' end of DNA sequences of 

35 interest in order to facilitate purification of the recombinant protein products. The 3' end 
was selected for fusion in order to avoid alteration of any 5' terminal signal sequence. The 
exception to the above was ppiB, a gene cloned for use as a control in the expression 
studies. In this study, the sequence for H. pylori ppiB contains a DNA sequence encoding a 
His-Tag fused to the 5' end of the full length gene, because the protein product of this gene 

40 does not contain a signal sequence and is expressed as a cytosolic protein. 
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5 PCR Amplification and cloning ofDNA sequences containing ORF 'sfor membrane and 
secreted proteins from the J99 Strain of Helicobacter pylori. 

Sequences chosen (from the list of the DNA sequences of the invention) for cloning 
from the J99 strain of//, pylori were prepared for amplification cloning by polymerase 
chain reaction (PCR). Synthetic oligonucleotide primers (Table 2) specific for the 5* and 3* 

10 ends of open reading frames (ORFs) were designed and purchased (GibcoBRL Life 

Technologies, Gaithersburg, MD, USA). All forward primers (specific for the 5' end of the 
sequence) were designed to include an Ncol cloning site at the extreme 5 1 terminus, except 
for H. pylori sequence 4821082 (SEQ ID NO: 212) where Ndel was used. These primers 
were designed to permit initiation of protein translation at a methionine residue followed by 

1 5 a valine residue and the coding sequence for the remainder of the native H. pylori DNA 
sequence. An exception is H. pylori sequence 482 1 082 (SEQ ID NO: 212) where the 
initiator methionine is immediately followed by the remainder of the native K pylori DNA 
sequence. All reverse primers (specific for the 3' end of any //. pylori ORF) included a 
EcoRI site at the extreme 5' terminus to permit cloning of each H. pylori sequence into the 

20 reading frame of the pET-28b. The pET-28b vector provides sequence encoding an 

additional 20 carboxy-terminal amino acids including six histidine residues (at the extreme 
C-terminus), which comprise the His-Tag. An exception to the above, as noted earlier, is 
the vector construction for the ppiB gene. A synthetic oligonucleotide primer specific for 
the 5* end of ppiB gene encoded a BamHI site at its extreme 5' terminus and the primer for 

25 the 3' end of the ppiB gene encoded a Xhol site at its extreme 5' terminus. 



TABLE 2 

Oligonucleotide primers used for PCR amplification of //. pylori DNA sequences 



Outer membrane Proteins 



Forward primer 5* tto 3' 



Reverse Primer 5* Ho 3 1 



7116626 (SEQ ID NO: 223) 



S'-ATATCCATGGTGAGT 
TTGATGA-3' (SEQ ID 
NO:229) 



5'-ATGAATTCAATTTTT 
T A TTTTGCC A - 3 ' (SEQ ID 
NO:230) 



29479681 (SEQ ID NO: 179) 



5'-AATTCCATGGTGGGG 
GCTATG-3' (SEQ ID 
NO:231) 



S'-ATGAATTCTCGATAG 
CCAAAATC-3' (SEQ ID 
NO:232) 



Periplasmic/ Secreted 
Proteins 



30100332 (SEQ ID NO: 181) 



S'-ATTTCCATGGTCATG 
TCTCATATT-3' (SEQ ID 
NO:233) 



4721061 (SEQ ID NO: 211) 



S'-ATGAATTCCATCTTT 
TATTCCAC-3' (SEQ ID 
NO:234) 



S'-AACCATGGTGATTT 
TAAGCATTGAAAG-3' 
(SEQ IDNO:235) 



5-AAGAATTCCACTCA 
AAATTTTTTAACAG-3' 
(SEQIDNO:236) 
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5 



Other Surface Proteins 






4821082 (SEQ 113 NO. 1 \2.) 


ATCTTCTAAT-3' (SEQ ID 
NO:237) 


5'-TGAATTCAACCATTT 
TAACCCTG-3' (SEQ ID 
NO:238) 


978477 (SEQ ID NO: 228) 


5'-TATACCATGGTGAA 
ATTTTTTCTTTTA-3* (SEQ 
IDNO:239) 


5-AGAATTCAATTGCG 
TCTTGTAAAAG-3' (SEQ ID 
NO:240) 


Cytoplasmic Protein 






ppiB 

i 


5*-TTATGGATCCAAAC 
CA ATTAAA ACT-T (SFQ ID 
| NO:241) 


5'-TATCTCG AGITATA 
GAGAAGCOr.-r (SEQ ID 
| NO:242) 



Genomic DNA prepared from the J99 strain of//, pylori (ATCC #55679) was used 
as the source of template DNA for PCR amplification reactions (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds.. 1994). To amplify £ 

1 0 DNA sequence containing an H. pylori ORF. genomic DNA (50 nanograms) was 
introduced into a reaction vial containing 2 mM MgCl 2 , 1 micromolar synthetic 
oligonucleotide primers (forward and reverse primers) complementary to and flanking a 
defined H pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, 
dCTP. dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq. Roche Molecular 

1 5 Systems, Inc.. Branchburg, NJ, USA) in a final volume of 1 00 microliters. The following 
thermal cycling conditions were used to obtain amplified DNA products for each ORF 
using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal cycler: 

Sequences 7116626 (SEQ ID NO: 223), 29479681 (SEQ ID NO: 179), 30100332 
20 (SEQ ID NO: 181), 4821082 (SEQ ID NO: 212) and 978477 (SEQ ID NO: 228); 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 30°C for 15 sec and 72°C for 1.5 min 
23 cycles at 94°C for 1 5 sec, 55°C for 1 5 sec and 72°C for 1 .5 min 
Reactions were concluded at 72°C for 6 minutes. 

25 

Sequence 4721061 (SEQ ID NO: 211); 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 36°C for 15 sec and 72°C for 1.5 min 
23 cycles at 94°C for 1 5 sec, 60°C for 1 5 sec and 72°C for 1 .5 min 
30 Reactions were concluded at 72°C for 6 minutes. 

Conditions for amplification of//, pylori ppiB; 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 32°C for 15 sec and 72°C for 1.5 min 
35 25 cycles at 94°C for 1 5 sec, 56°C for 1 5 sec and 72°C for 1 .5 min 
Reactions were concluded at 72°C for 6 minutes 



BNSDOCID: <WO 9719098A1_I_> 



WO 97/19098 



PCT/US96/18542 



-50- 

5 Upon completion of thermal cycling reactions, each sample of amplified DNA was 

washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, 
MD. USA). All amplified DNA samples were subjected to digestion with the restriction 
endonucleases, Ncol and EcoRI (New England BioLabs, Beverly, MA, USA), or in the 
case ofH. pylori sequence 4821082 (SEQ ID NO: 212), with Ndel and EcoRI (Current 

10 Protocols in Molecular Biology, John Wiley and Sons, Inc.. F. Ausubel et al.. eds., 1994). 
DNA samples were then subjected to electrophoresis on 1.0 % NuSeive (FMC 
BioProducts. Rockland, ME USA) agarose gels. DNA was visualized by exposure to 
ethidium bromide and long wave uv irradiation. DNA contained in slices isolated from the 
agarose gel was purified using the Bio 101 GeneClean Kit protocol (Bio 101 Vista, CA, 

1 5 USA). 

Cloning ofH. pylori DNA sequences info (he pET~28b prokaryotic expression vector. 

The pET-28b vector was prepared for cloning by digestion with Ncol and EcoRI, or 
in the case of//, pylori sequence 4821082 (SEQ ID NO: 212) with Ndel and EcoRI 

20 (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 
1 994). In the case of cloning ppiB. the pET-28a vector, which encodes a His-Tag that can 
be fused to the 5' end of an inserted gene, was used and the cloning site prepared for 
cloning with the ppiB gene by digestion with BamHl and Xhol restriction endonucleases. 
Following digestion, DNA inserts were cloned (Current Protocols in Molecular 

25 Biology, John Wiley and Sons, Inc., F. Ausubel et ah, eds., 1994) into the previously 

digested pET-28b expression vector, except for the amplified insert for ppiB, which was 
cloned into the pET-28a expression vector. Products of the ligation reaction were then used 
to transform the BL21 strain of E. coli (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et aL, eds., 1994) as described below. 

30 

Transformation of competent bacteria with recombinant plasmids 

Competent bacteria, E coli strain BL21 or £. coli strain BL21(DE3), were 
transformed with recombinant pET expression plasmids carrying the cloned H. pylori 
sequences according to standard methods (Current Protocols in Molecular, John Wiley and 

35 Sons, Inc., F. Ausubel et al., eds., 1994). Briefly, 1 microliter of ligation reaction was 

mixed with 50 microliters of electrocompetent cells and subjected to a high voltage pulse, 
after which, samples were incubated in 0.45 milliliters SOC medium (0.5% yeast extract, 
2.0 % tryptone, 10 mM NaCl, 2.5 mM KC1, 10 mM MgC12, 10 mM MgS04 and 20, mM 
glucose) at 37°C with shaking for 1 hour. Samples were then spread on LB agar plates 

40 containing 25 microgram/ml kanamycin sulfate for growth overnight. Transformed 

colonies of BL21 were then picked and analyzed to evaluate cloned inserts as described 
below. 
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5 Identification of recombinant pET expression plasmids carrying H. pylori sequences 

Individual BL21 clones transformed with recombinant pET-28b-H.pylori ORFs 
were analyzed by PCR amplification of the cloned inserts using the same forward and 
reverse primers, specific for each H. pylori sequence, that were used in the original PCR 
amplification cloning reactions. Successful amplification verified the integration of the H. 
1 0 pylori sequences in the expression vector (Current Protocols in Molecular Biology, John 
Wiley and Sons. Inc., F. Ausubel et al., eds., 1994). 

Isolation and Preparation of plasmid DNA from BL2I transformants 

Individual clones of recombinant pET-28b vectors carrying properly cloned //. 
15 pylori ORFs were picked and incubated in 5 mis of LB broth plus 25 microgram/ml 

kanamycin sulfate overnight. The following day plasmid DNA was isolated and purified 
using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA ? USA). 

Expression of recombinant H. pylori sequences in E. coli 

20 The pET vector can be propagated in any E. coli K-12 strain e.g. HMS174, HBlOh 

JM109, DH5. etc. for the purpose of cloning or plasmid preparation. Hosts for expression 
include E. coli strains containing a chromosomal copy of the gene for T7 RNA polymerase. 
These hosts are lysogens of bacteriophage DE3, a lambda derivative that carries the lad 
gene, the lacUV5 promoter and the gene for T7 RNA polymerase. T7 RNA polymerase is 

25 induced by addition of isopropyl-B-D-thiogalactoside (IPTG), and the T7 RNA polymerase 
transcribes any target plasmid, such as pET-28b, carrying a T7 promoter and a gene of 
interest. Strains used include: BL21(DE3) (Studier, F.W., Rosenberg, A.H., Dunn, J.J., and 
Dubendorff J.W. (1990) Meth. Enzymol. 185, 60-89). 

To express recombinant H. pylori sequences, 50 nanograms of plasmid DNA 

30 isolated as described above was used to transform competent BL2 1(DE3) bacteria as 

described above (provided by Novagen as part of the pET expression system kit). The lacZ 
gene (beta-galactosidase) was expressed in the pET-System as described for the H. pylori 
recombinant constructions. Transformed cells were cultured in SOC medium for 1 hour, 
and the culture was then plated on LB plates containing 25 micrograms/ml kanamycin 

35 sulfate. The following day, bacterial colonies were pooled and grown in LB medium 

containing kanamycin sulfate (25 micrograms/ml) to an optical density at 600 nM of 0.5 to 
1 .0 O.D. units, at which point, 1 millimolar IPTG was added to the culture for 3 hours to 
induce gene expression of the H. pylori recombinant DNA constructions . 

After induction of gene expression with IPTG, bacteria were pelleted by 

40 centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 1 5 minutes at 4°C. Pellets 
were resuspended in 50 milliliters of cold 10 mM Tris-HCl, pH 8.0 ? 0.1 M NaCl and 0.1 
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5 mM EDTA (STE buffer). Cells were then centrifuged at 2000 x g for 20 min at 4°C. Wet 
pellets were weighed and frozen at -80°C until ready for protein purification. 

III. Purification of recombinant proteins from E. coli 
Analytical Methods 

1 o The concentrations of purified protein preparations were quantified 

spectrophotometrically using absorbance coefficients calculated from amino acid content 
(Perkins. S.J. 1986 Eur. J. Biochem. 157, 169-180). Protein concentrations were also 
measured by the method of Bradford, M.M. (1976) Anal. Biochem. 72, 248-254. and 
Lowry, O.I I.. Roscbrough, N., Farr, A.L. & Randall, R.J. (1951) J. Biol. Chcm. 193. pages 

1 5 265-275, using bovine serum albumin as a standard. 

SDS-polyacrylamide gels (12% or 4.0 to 25 % acrylamide gradient gels) were 
purchased from BioRad (Hercules, CA, USA), and stained with Coomassie blue. 
Molecular weight markers included rabbit skeletal muscle myosin (200 kDa). E. coli (- 
galactosidase (116 kDa), rabbit muscle phosphorylase B (97.4 kDa), bovine serum albumin 

20 (66.2 kDa), ovalbumin (45 kDa), bovine carbonic anhydrase (3 1 kDa), soybean trypsin 
inhibitor (21 .5 kDa), egg white lysozyme (14.4 kDa) and bovine aprotinin (6.5 kDa). 

1. Purification of soluble proteins 

All steps were carried out at 4°C. Frozen cells were thawed, resuspended in 5 
25 volumes of lysis buffer (20 mM Tris, pH 7.9, 0.5 M NaCl, 5 mM imidazole with 1 0% 
glycerol, 0. 1 % 2-mercaptoethanol, 200 ug/ ml lysozyme, 1 mM phenylmethylsulfonyl 
fluoride (PMSF). and 10 ug/ml each of leupeptin, aprotinin, pepstatin, L-l-chloro-3-[4- 
tosylamido]-7-amino-2-heptanone (TLCK), L-l -chloro-3-[4-tosylamido]-4-phenyl-2- 
butanone (TPCK), and soybean trypsin inhibitor, and ruptured by several passages through 
30 a small volume microfluidizer (Model M-l 1 OS. Microfluidics International Corporation, 
Newton, MA). The resultant homogenate was made 0.1 % Brij 35, and centrifuged at 
100,000 x g for 1 hour to yield a clear supernatant (crude extract). 

Following filtration through a 0.8 urn Supor filter (Gelman Sciences, FRG) the 
crude extract was loaded directly onto a Ni 2+ ' nitrilotriacetate-agarose (NTA) with a 5 
35 milliliter bed volume (Hochuli, E., Dbeli, H., and Schacheer, A. (1 987) J. Chromatography 
41 1, 177-184) pre-equilibrated in lysis buffer containing 10 % glycerol, 0.1 % Brij 35 and 1 
mM PMSF. The column was washed with 250 ml (50 bed volumes) of lysis buffer 
containing 10 % glycerol, 0.1 % Brij 35, and was eluted with sequential steps of lysis 
buffer containing 10 % glycerol, 0.05 % Brij 35, 1 mM PMSF, and 20, 100, 200, and 500 
40 mM imidazole in succession. Fractions were monitored by absorbance at OD 2 80 nm - and 
peak fractions were analyzed by SDS-PAGE. Fractions containing the recombinant protein 
eluted at 100 mM imidazole. 
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5 Recombinant beta-galactosidase (lacZ) and pepiidyl-prolyl cis-trans isomerase (ppiB) 

Fractions containing the recombinant proteins from the Ni2 + -NTA -agarose columns 
were pooled and then concentrated to approximately 5 ml by centrifugal filtration 
(Centriprep-10. Amicon, MA), and loaded directly onto a 180-ml column (1 .6 X 9] cm) of 
Sephacryl S-100 HR gel filtration medium equilibrated in Buffer A (10 mM Hepes, pH 7.5, 
10 150 mM NaCl, 0. 1 mM EGTA) and run in Buffer A at 18 ml/h. Fractions containing the 
recombinant protein were identified by absorbance at 280 nm and analyzed by SDS-PAGE. 
Fractions were pooled and concentrated by centrifugal filtration. 

Recombinant protein 7116626 (SEQ ID NO: 223) 

1 5 Fractions containing the recombinant protein from the Ni-* 1 " -NTA-agarose column 

were pooled and dialyzed overnight against 1 liter of dialysis buffer (10 mM MOPS, pH 
6.5 r 50 mM NaCK 0.1 mM EGTA, 0.02% Brij 35 and 1 mM PMSF). In the morning, a fine 
white precipitate was removed by centrifugation and the resulting supernatant was loaded 
onto an 8 ml (8 x 75 mm) MonoS high performance liquid chromatography column 

20 (Pharmacia Biotechnology, Inc., Piscataway, NJ, USA) equilibrated in buffer B (10 mM 

MOPS, pH 6.5. 0.1 mM EGTA) containing 50 mM NaCI. The column was washed with 10 
bed volumes of buffer B containing 50 mM NaCl, and developed with a 50-ml linear 
gradient of increasing NaCl (50 to 500 mM). Recombinant protein 71 16626 (SEQ ID NO: 
223) eluted as a sharp peak at 300 mM NaCl. 

25 

2. Purification of insoluble proteins from inclusion bodies 

The following steps were carried out at 4°C. Cell pellets were resuspended in lysis 
buffer with 10% glycerol 200 jag/ ml lysozyme, 5 mM EDTA, ImM PMSF and 0.1 % - 
mercaptoethanol. After passage through the cell disrupter, the resulting homogenate was 

30 made 0.2 % deoxycholate, stirred 10 minutes, then centrifuged at 20,000 x g, for 30 min. 
The pellets were washed with lysis buffer containing 10 % glycerol, 10 mM EDTA, 1% 
Triton X-100. 1 mM PMSF and 0.1% -mercaptoethanol. followed by several washes with 
lysis buffer containing 1 M urea, 1 mM PMSF and 0.1 % 2-mercaptoethanoL The resulting 
white pellet was composed primarily of inclusion bodies, free of unbroken cells and 

35 membranous materials.. 

Recombinant proteins 30100332 (SEQ ID NO: 181), 472 106 J (SEQ ID NO: 211) 

The following steps were carried out at room temperature. Purified inclusion bodies 
were dissolved in 20 ml 8.0 M urea in lysis buffer with 1 mM PMSF and 0.1 % 2- 
40 mercaptoethanol, and incubated at room temperature for 1 hour. Materials that did not 

dissolve were removed by centrifugation. The clear supernatant was filtered, then loaded 
onto a Ni2 + -NT A agarose column pre-equilibrated in 8.0 M urea in Lysis Buffer. The 
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5 column was washed with 250 ml (50 bed volumes) of lysis buffer containing 8 M urea, 1 .0 
mM PMSF and 0.1 % 2-mercaptoethanol, and developed with sequential steps of lysis 
buffer containing 8M urea. 1 mM PMSF. 0.1 % 2-mercaptoethanol and 20. 100 ? 200, and 
500 mM imidazole in succession. Fractions were monitored by absorbance at OD2gQ nm, 
and peak fractions were analyzed by SDS-PAGE. Fractions containing the recombinant 
10 protein eluted at 100 mM imidazole. 



Recombinant proteins 29479681 (SEQ ID NO: 1 79), 978477 (SEQ ID NO: 228) 

The pellet containing the inclusion bodies was solubilized in buffer B containing 8 
M urea. 1 mM PMSF and 0.1 % 2-mercaptoethanol. and incubated for 1 hour at room 
1 5 temperature. Insoluble materials were removed by centrifugation at 20,000 x g for 30 min, 
and the cleared supernatant was loaded onto a 1 5 ml ( 1 .6 x 7.5 cm ) SP-Sepharose column 
pre-equilibrated in buffer B, 6 M urea, 1 mM PMSF, 0.1 % 2-mercaptoethanol. After 
washing the column with 10 bed volumes, the column was developed with a linear gradient 
from 0 to 500 mM NaCl. 

20 

Dialysis and concentration of protein samples 

Urea was removed slowly from the protein samples by dialysis against Tris- 
buffered saline (TBS; 10 mM Tris pH 8.0, 1 50 mM NaCl) containing 0.5 % deoxycholate 
(DOC) with sequential reduction in urea concentration as follows; 6M. 4M, 3M. 2M, 1M, 
25 0.5 M and finally TBS without any urea. Each dialysis step was conducted for a minimum 
of 4 hours at room temperature. 

After dialysis, samples were concentrated by pressure filtration using Amicon 
stirred-cells. Protein concentrations were measured using the methods of Perkins (1986 
Eur. J. Biochem. 157, 169-180), Bradford ((1976) Anal. Biochem. 72, 248-254) and Lowry 
30 ((1951) J. Biol. Chem. 193. pages 265-275). 

The recombinant proteins purified by the methods described above are summarized 
in Table 3 below. 



TABLE 3 



J99 
Sequence 
Identifier 


Homolog 
identified 
by Blast 


Gene 
symbol of 
Homolog 


Baccterial 
fraction 
used to 
purify 

recombinant 
proteins 


Method of 
purification 


Relative 
MWon 
SDS-PAGE 
gel 


Final Con- 
centration 
of purified 
protein 


Composition 
of buffer 


Outer Membrane Proteins 


71 16626 
(SEQ ID 
NO: 223) 


P26093 


e(P4) 


Soluble 
fraction 


His-Tag 


29kDa 


0.8 mg/ml 


A 














1.85 mg/ml 


C 


29479681 
(SEQ ID 
NO: 179) 


P13036 


fecA 


Inclusions 
bodies 


SP- 
Sepharose 


23 kDa 


2.36 mg/ml 


B 
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0.5 mg ml 


B 
















as dry pellet 










gel filtration SI 00 HR 






Periplasm ic/Secreted Protein 


3010032 
(SEQ ID 
NO: 181) 


P23847 


dppA 


Inclusion 
bodies 


His- Tag 


1 1 kDa 


2.88 rng/ml 


B 


4721061 
(SEQ ID 
NO: 211) 


P36175 


GCP 


Inclusion 
bodies 


His-Tag 


38 kDa 


2.8 mg/ml 


B 


Other Surface Proteins 


4821082 
(SEQ ID 
NO:212) 


P08089 


M protein 


Inclusion 
bodies 


His-Tag 


20 kDa 


1.16 mg/mi 


B 


978477 
(SEQ ID 
NO: 228) 


L28919 


FBP54 


Inclusion 
bodies 


SP- 
Sepharose 


44 kDa 


2.56 mg/ml 


B 














0.3 mg/ml 


B 


Control Proteins with His- Tag 




P00722 


lacZ 


Soluble 
fraction 


His-Tag 


1 1 6 kDa 


10 mg/ml 


A 










gel filtration S200 HR 










ppiB 


Soluble 
fraction 


His-Tag 


21 kDa 


4.4 mg/ml 


A 










gel filtration S 1 00 HR 







A- 1 0 mM Hepes pH 7.5, 150 mM NaCl, O.lmM EGTA 
B= lOmMTris pH 8.0, 150 mM NaCl, 0.5 % DOC 
C- 10 mM MOPS pH 6.5, 300 mM NaCl, 0.1 EGTA 



10 IV. Analysis of//, pylori proteins as Vaccine candidates 

To investigate the immunomodulatory effect of H. pylori proteins, a mouse///. 
pylori model was used. This model mimics the human H. pylori infection in many 
respects. The focus is on the effect of oral immunization in K pylori infected animals in 
order to test the concept of therapeutic oral immunotherapy. 

15 

Animals 

Female SPF BALB/c mice were purchased from Bomholt Breeding center 
(Denmark). They were kept in ordinary makrolon cages with free supply of water and food. 
The animals were 4-6 weeks old at arrival. 

20 

Infection 

After a minimum of one week of acclimatization, the animals were infected with a 
type 2 strain (VacA negative) of//, pylori (strain AH244, originally isolated from an ulcer 
patient). In our hands, this strain has earlier proven to be a good colonizer of the mouse 
25 stomach. The bacteria were grown overnight in Brucella broth supplemented with 10 % 
fetal calf serum, at 37°C in a microaerophilic atmosphere ( 1 0% C0 2 , 5%0 2 ). The animals 
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5 were given an oral dose of omeprazole (400 jimol/kg) and 3-5 h after this an oral 

inoculation of H. pylori in broth (approximately 10 8 cfu/animal). Positive take of the 
infection was checked in some animals 2-3 weeks after the inoculation. 

Antigens 

\ o Recombinant H. pylori antigens were chosen based on their association with 

externally exposed H. pylori cell membrane. These antigens were selected from the 
following groups: (1.) Outer Membrane Proteins; (2.) Periplastic/Secreted proteins; (3.) 
Outer Surface proteins; and (4.) Inner Membrane proteins. All recombinant proteins were 
constructed with a hexa-HIS tag for purification reasons and the rsor>~ Helicobacter pylori 
15 control protein (p-galactosidase from E. coli\ LacZX was constructed in the same way. 

AH antigens were given in a soluble form, i.e. dissolved in either a HEPES buffer or 
in a buffer containing 0.5% Deoxycholate (DOC). 
The antigens are listed in Table 4 below. 

20 Table 4 

Helicobacter pylori proteins 

Outer membrane Proteins 
SEQ ID NO: 179 
25 SEQ ID NO:223 

PeripSastic/Secreted protekns 
SEQ IDNO:181 
SEQ IDNO:211 

30 

Other cell envelope proteims 
SEQ IDNO:212 
SEQ ID NO:228 

35 Contrail proteins 

P-galactosidase (LacZ) 

Immunizations 

Ten animals in each group were immunized 4 times over a 34 day period 
40 (day 1,15,25 and 35). Purified antigens in solution or suspension were given at a dose of 
100 ng/mouse. As an adjuvant, the animals were also given 10 ^ig/mouse of Cholera toxin 
(CT) with each immunization. Omeprazole (400 nmol/kg) was given orally to the animals 
3-5 h prior to immunization as a way of protecting the antigens from acid degradation. 
Infected control animals received HEPES buffer + CT or DOC buffer + CT. Animals were 
45 sacrificed 2-4 weeks after final immunization. A general outline of the study is shown in 
Table 5 below. 
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5 Table 5 

Study outline, therapeutic immunization: 

Mice were all infected with H. pylori strain AH244 at day 30. Proteins are listed by their Seq ID #'s. 
10 Substance 



Mouse strain 


Dose/mouse 


Dates for 


n=10 




dosing 


Balb/c 


0.3 ml 


0, 14, 24, 34 


Balb/c 


0.3 ml 


0, 14, 24,34 


Balb/c 


0.3 ml 


0, 14, 24, 34 


Balb/c 


0.3 ml 


0, 14, 24, 34 


Balb/c 


0.3 ml 


0. 14.24,34 


Balb/c 


0.3 ml 


0, 14, 24, 34 


Balb/c 


0.3 ml 


0, 14, 24. 34 


Balb/c 


0.3 ml 


0, 14. 24, 34 



1. Controls, PBS 

2. Cholera toxin, 10 |ig 

3. Protein 1 79, 1 00 jag + CT 1 0 ng 
15 4. Protein 181, 100 ng + CT 10 ^g 

5. Protein 211, 1 00 jag + CT 1 0 jig 

6. Protein 212. iOO fig + CT 10 yig 

7. Protein 228, 100 ng + CT 10 ng 

8. Protein 223, 100 jig + CT 10 ng 

20 

Analysis of infection 

Mucosal infection: The mice were sacrificed by C0 2 and cervical dislocation. The 
abdomen was opened and the stomach removed. After cutting the stomach along the 
greater curvature, it was rinsed in saline. The mucosa from the antrum and corpus of an 

25 area of 25mm' was scraped separately with a surgical scalpel. The mucosa scraping was 
suspended in Brucella broth and plated onto Blood Skirrow selective plates. The plates 
were incubated under microaerophilic conditions for 3-5 days and the number of colonies 
was counted. The identity of K pylori was ascertained by urease and catalase test and by 
direct microscopy or Gram staining. 

30 The urease test was performed essentially as follows. The reagent. Urea Agar Base 

Concentrate, was purchased from DIFCO Laboratories, Detroit, Ml (Catalog # 0284-61-3). 
Urea agar base concentrate was diluted 1:10 with water. 1 ml of the diluted concentrate 
was mixed with 100-200 jil of actively growing H. pylori cells. Color change to magenta 
indicated that cells were urease positive. 

35 The catalase test was performed essentially as follows. The reagent, N,N,N',N'- 

Tetramethyl-p-Phenylenediamine, was purchased from Sigma, St. Louis, MO (Catalog # 
T3 1 34). A solution of the reagent (1% w/v in water) was prepared. K pylori cells were 
swabbed onto Whatman filter paper and overlaid with the 1% solution. Color change to 
dark blue indicated that the cells were catalase positive. 

40 Serum antibodies: From all mice serum was prepared from blood drawn by heart 

puncture. Serum antibodies were identified by regular ELISA techniques, where the 
specific antigens of Helicobacter pylori were plated. 

Mucosal antibodies: Gentle scrapings of a defined part of the corpus and of 4 cm of 
duodenum were performed in 50% of the mice in order to detect the presence of antibodies 
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5 in the mucous. The antibody titers were determined by regular ELISA technique as for 
serum antibodies. 

Statistical analysis: Wilcoxon-Mann- Whitney sign rank test was used for 
determination of significant effects of the antigens on Helicobacter pylori colonization. 
PO.05 was considered significant. Because the antrum is the major colonization site for 
10 Helicobacter most emphasis was put upon changes in the antral colonization. 

Results 

Antibodies in sera : All antigens tested given together with CT gave rise to a 
measurable specific titer in serum. The highest responses were seen with SEQ ID 
15 NOs:223, 21 1, and 212 (see Figure 2). 

Antibodies in mucus : In the mucus scrapings, specific antibodies against all 
antigens tested were seen. By far the strongest response was seen with SEQ ID NOs: 1 8L 
followed by 223 (see Figure 3). 

Therapeutic immunization effects: 
20 All control animals (BALB/c mice) were well colonized with H. pylori (strain 

AH244) in both antrum and corpus of the stomach. Of the antigens tested 2 proteins (SEQ 
ID NOs: 21 1 and 212) gave a good and significant reduction and/or eradication of the H 
pylori infection. The degree of colonization of the antrum was lower following 
immunization with SEQ ID NOs: 228 and 223 compared to control. The effect of SEQ ID 
25 NOs: 1 79, and 181 did not differ from control. The control protein lacZ, i.e. the non-//. 
pylori protein, had no eradication effect and in fact had higher Helicobacter colonization 
compared to the HEPES + CT control. All data are shown in Figures 4 and 5 for proteins 
dissolved in HEPES and DOC respectively. Data is shown as geometric mean values. n=8- 
10 Wilcoxon-Mann-Whitney sign rank test * = p<0.05; x/10 = number of mice showing 
30 eradication of H pylori over the total number of mice examined. 

The data presented indicate that all of the H pylori associated proteins included in 
this study, when used as oral immunogens in conjunction with the oral adjuvant CT r 
resulted in stimulation of an immune response as measured by specific serum and mucosal 
antibodies. A majority of the proteins led to a reduction, and in some cases complete 
35 clearance of the colonization of H. pylori in this animal model. It should be noted that the 
reduction or clearance was due to heterologous protection rather than homologous 
protection (the polypeptides were based on the H pylori J99 strain sequence and used in the 
therapeutic immunization studies against a different ( AH244) challenge strain), indicating 
the vaccine potential against a wide variety of//, pylori strains. 
40 The highest colonization in the antrum was seen in animals treated with the non- 

Helicobacter protein LacZ, indicating that the effects seen with the Helicobacter pylori 
antigens were specific. 
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5 Taken together these data strongly support the use of these H. pylori proteins in a 

pharmaceutical formulation for the use in humans to treat and/or prevent H. pylori 
infections. 

V. Sequence Variance Analysis of genes in Helicobacter pylori strains 
1 0 Three genes were cloned and sequenced from several strains of H pylori to 

compare the DNA and deduced amino acid sequences. This information was used to 
determine the sequence variation between the H. pylori strain, J99, and other H. pylori 
strains isolated from human patients. 

1 5 Preparation of Chromosomal DNA. 

Cultures of H pylori strains (as listed in Table 8) were grown in BLBB (1% 
Tryptone, 1% Peptamin 0.1% Glucose, 0.2% Yeast Extract 0.5% Sodium Chloride, 5% 
Fetal Bovine Serum) to an OD 600 of 0.2. Cells were centrifuged in a Sorvall RC-3B at 
3500 x g at 4°C for 15 minutes and the pellet resuspended in 0.95 mis of 10 mM Tris-HCl, 

20 0. 1 mM EDTA (TE). Lysozyme was added to a final concentration of lmg/ml along with 
SDS to 1% and RNAse A + Tl to 0.5mg/ml and 5 units/ml respectively, and incubated at 
37°C for one hour. Proteinase K was then added to a final concentration of 0.4mg/rnl and 
the sample was incubated at 55°C for more than one hour. NaCl was added to the sample 
to a concentration of 0.65 M, mixed carefully, and 0.15 ml of 10% CTAB in 0.7M NaCL 

25 (final is 1 % CTAB/70mM NaCL) was added followed by incubation at 65°C for 20 
minutes. At this point, the samples were extracted with chloroform:isoamyl alcohol, 
extracted with phenol, and extracted again with chloroform:isoamyl alcohol. DNA was 
precipitated with either EtOH (1.5 x volumes) or isopropanol (0.6 x volumes) at -70°C for 
1 0 minutes, washed in 70% EtOH and resuspended in TE. 

30 

PCR Amplification and cloning. 

Genomic DNA prepared from twelve strains of Helicobacter pylori was used as the 
source of template DNA for PCR amplification reactions (Current Protocols in Molecular 
Biology, John Wiley and Sons, Inc., F. Ausubel et aL editors, 1994). To amplify a DNA 

35 sequence containing an H pylori ORF, genomic DNA (10 nanograms) was introduced into 
a reaction vial containing 2 mM MgCl 2 , 1 micromolar synthetic oligonucleotide primers 
(forward and reverse primers, see Table 6) complementary to and flanking a defined H. 
pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP. dCTP, dTTP and 
0.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., 

40 Branchburg, NJ, USA) in a final volume of 20 microliters in duplicate reactions. 
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5 Table 6 



Outer membrane 
Proteins 


Forward primer 5' to 3' 


Reverse Primer 5' to 3' 


SEQ ID NO:223 


5-ATATCCATGGTGAGTTTGA 
TGA-3' (SEQ lDNO:243) 


5-ATGAATTCAATTTTTTATTTT 
GCCA-3' (SEQ ID NO:244) 


SEQ ID NO: 179 


5'-AATTCCATGGCTATCCAAAT 
CCG-3' (SEQ IDNO:245) 


5'-ATGAATTCGCCAAAATCGTA 
GTATr-3' (SEQ ID NO:246) 


SEQ ID NO: 199 


5-GATACCATGGAATTTATGAA 
AAAG-3' (SEQ ID NO:247) 


5*-TGAATTCGAAAAAGTGTAGT 
TATAC-3' (SEQ ID NO:248) 



The following thermal cycling conditions were used to obtain amplified DNA 
products for each ORF using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal 
10 cycler: 

Sequences (by SEQ ID NO:) 223 and 199; 
Denaturation at 94°C for 2 min, 

2 cycles at 94°C for 15 sec, 30°C for 15 sec and 72°C for 1.5 min 
1 5 23 cycles at 94°C for 1 5 sec ? 55°C for 1 5 sec and 72°C for 1 .5 min 
Reactions were concluded at 72°C for 6 minutes. 

Sequences (by SEQ ID NO:) 179; 
Denaturation at 94°C for 2 min, 
20 2 cycles at 94°C for 1 5 sec, 30°C for 20 sec and 72°C for 2 min 
25 cycles at 94°C for 1 5 sec ; 55°C for 20 sec and 72°C for 2 min 
Reactions were concluded at 72°C for 8 minutes. 

Upon completion of thermal cycling reactions, each pair of samples were combined 
25 and used directly for cloning into the pCR cloning vector as described below. 

Cloning of K pylori DNA sequences into the pCR TA cloning vector. 

All amplified inserts were cloned into the pCR 2.1 (pCRII in the case of H. pylori 
sequence 223) vector by the method described in the Original TA cloning kit (Invitrogen, 
30 San Diego, CA). Products of the ligation reaction were then used to transform the 

TOPI OF 1 (INVaF in the case of//, pylori sequence 223) strain of E. coli as described 
below. 

Transformation of competent bacteria with recombinant plasmids 
35 Competent bacteria, E coli strain TOPI OF* or E. coli strain INVaF* were 

transformed with recombinant pCR expression plasmids carrying the cloned //. pylori 
sequences according to standard methods (Current Protocols in Molecular Biology, John 
Wiley and Sons, Inc., F. Ausubel et aL editors, 1994). Briefly, 2 microliters of 0.5 
micromolar BME was added to each vial of 50 microliters of competent cells. 
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5 Subsequently, 2 microliters of ligation reaction was mixed with the competent cells and 
incubated on ice for 30 minutes. The cells and ligation mixture were then subjected to a 
''heat shock 1 ' at 42°C for 30 seconds, and were subsequently placed on ice for an additional 
2 minutes, after which, samples were incubated in 0.45 milliliters SOC medium (0.5% 
yeast extract, 2.0 % tryptone, 10 mM NaCL 2.5 mM KC1, 10 mM MgCl2 r 10 mM MgS04 
10 and 20, mM glucose) at 37°C with shaking for I hour. Samples were then spread on LB 
agar plates containing 25 microgram/ml kanamycin sulfate or 100 micrograms/ml 
ampicillan for growth overnight. Transformed colonies of TOPI OF or INVaF were then 
picked and analyzed to evaluate cloned inserts as described below. 

1 5 Identification of recombinant PCR plasmids carrying H. pylori sequences 

Individual TOPI OF' or INVaF 5 clones transformed with recombinant pCR-H.pylori 
ORFs were analyzed by PCR amplification of the cloned inserts using the same forward 
and reverse primers, specific for each H. pylori sequence, that were used in the original 
PCR amplification cloning reactions. Successful amplification verified the integration of 

20 the H. pylori sequences in the cloning vector (Current Protocols in Molecular Biology. John 
Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). 

Individual clones of recombinant pCR vectors earning properly cloned H. pylori 
ORFs were picked for sequence analysis. Sequence analysis was performed on AB1 
Sequencers using standard protocols (Perkin Elmer) using vector-specific primers (as found 

25 in PCRII or pCR2.1, Invitrogen, San Diego, CA) and sequencing primers specific to the 
ORF as listed in Table 7 below. 

Table 7 



Oligonucleotide primers used for sequencing of H. pylori DNA sequences. 

30 



Outer 

membrane 

Proteins 


Forward primers 5 1 to 3 1 


Reverse Primers 5* to 3' 








SEQ ED NO:223 


S'-TTGAACACTTTTGATTATGCGG-S' 
(SEQ ID NO:249) 

S'-GGATTATGCGATTG 11 1 IACAAG- 
3' (SEQ ID NO:250) 


5-GTCTTTAGCAAAAATGGCGTC- 
3' (SEQ IDNO:251) 
5-AATGAGCGTAAGAGAGCCTTC- 
3* (SEQ ID NO:252) 


SEQ ID NO: 179 


5'-CTTATGGGGGTATTGTCA-3* (SEQ ID 
NO:253) 

5-AGCATGTGGGTATCCAGC-3' (SEQ 
ID NO:254) 


5'-AGGTTGTTGCCTAAAGACT-3' 
(SEQ ID NO:255) 
S'-CTGCCTCCACCTTTGATCO' 
(SEQlDNO:256) 


SEQ KB NO: 199 


5*-ACCAATATCAATTGGCACT-3' (SEQ 
lDNO:257) 

5VVCTTGGAAAAGCTCTGCA-3' (SEQ 
IDNO:258) 


5'-CTTGCTTGTCATATCTAGC-3' 
(SEQ ID NO:259) 
5'-GTTGAAGTGTTGGTGCTA-3' 
(SEQ ID NO:260) 
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S'-CAAGCAAGTGGTTTGGTTTTAG-B' 
(SEQ IDNO:261) 

5-TGGAAAGAGCAAATCATTGAAG-3' 
(SEQ ID NO:262) 


5'-GCCCATAATCAAAAAGCCCAT- 
3* (SEQ IDNO:263) 
5-CTAAAACCAAACCACTTGCT 
TGTC-3' (SEQ ID NO:264) 








Vector Primers 


5'-GTAAAACGACGGCCAG-3' (SEQ ID 
NO:265) 


5'-CAGGAAACAGCTATGAC-3' 
(SEQ IDNO:266) 



5 

Results 



The PCR error rate for the experiments described herein was determined to be 2 
base changes out of 4485 bases, which is equivalent to an estimated error rate of less than 
or equal to 0.04% 

1 0 DNA sequence analysis was performed on four different open reading frames 

identified as genes and amplified by PCR methods from a dozen different strains of the 
bacterium Helicobacter pylori. The deduced amino acid sequences of three of the four 
open reading frames that were selected for this study showed statistically significant 
BLAST homology to defined proteins present in other bacterial species. Those ORFs 

1 5 included: SEQ ID NO:223, homologous to lipoprotein e (P4) present in the outer 

membrane of H. influenzae; SEQ ID NO: 179, homologous to fecA, an outer membrane 
receptor in iron (III) dicitrate transport in E. coli. SEQ ID NO: 1 99 was identified as an 
unknown open reading frame, because it showed low homology with sequences in the 
public databases. 

20 To assess the extent of conservation or variance in the ORFs across various strains 

of//, pylori, changes in DNA sequence and the deduced protein sequence were compared 
to the DNA and deduced protein sequences found in the J99 strain of//, pylori (see Table 8 
below). Results are presented as percent identity to the J99 strain of//, pylori sequenced by 
random shotgun cloning. To control for any variations in the J99 sequence each of the four 

25 open reading frames were cloned and sequenced again from the J99 bacterial strain and that 
sequence information was compared to the sequence information that had been collected 
from inserts cloned by random shotgun sequencing of the J99 strain. 
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5 Table 8 

Multiple Strain DNA Sequence analysis of H. pylori Vaccine Candidates 



J99 Sea. ID#: 


223 


223 


179 


179 


199 


199 


Length of Region 


232 a.a. 


696 nt. 


182 a.a. 


548 nt. 


273 a.a. 


819 nt. 


Sequenced: 














Strain Tested 
















AA 


Nuc. 


AA 


Nuc. 


AA identity 


Nuc. 




identity 


identity 


identity 


identity 




Identity 


J99 


100.00% 


100.00% 


100.00% 


100.00% 


99.63% 


99.88% 


AH244 


n.d. 


n.d. 


99.09% 


96.71% 


98.90% 


96.45% 


AH4 


97.84% 


95.83% 


n.d. 


n.d. 


97.80% 


95.73% 


AH5 


98.28% 


96.12% 


98.91% 


96.90% 


98.53% 


95.73% 


AH15 


97.41% 


95.98% 


99.82% 


97.997o 


99.63% 


96.09% 


AH61 


97.84% 


95.98% 


99.27% 


97.44% 


n.d. 


n.d. 


5155 


n.d. 


n.d. 


99.45% 


97.08% 


98.53% 


95.60% 


5294 


98.28% 


95.40% 


99.64% 


97.26% 


97.07% 


95.48% 


7958 


97.84% 


95.40% 


n.d. 


n.d. 


99.63% 


96.46% 


5640 


97.41% 


95.69% 


99.09% 


97.63% 


98.53% 


95.48% 


AH18 


98.71% 


95.69% 


99.64% 


97.44% 


100.00% 


95.97% 


AH24 


97.84% 


95.40% 


99.27% 


96.71% 


100.00% 


96.46% 



n.d. = not done 



VI. Experimental Knock-Out Protocol for the Determination of Essen tial H. pylori Genes 

as Potential Therapeutic Targets 
] o Therapeutic targets are chosen from genes whose protein products appear to play 

key roles in essential cell pathways such as cell envelope synthesis, DNA synthesis, 

transcription, translation, regulation and colonization/virulence. 

The protocol for the deletion of portions of H. pylori genes/ORFs and the 

insertional mutagenesis of a kanamycin-resistance cassette in order to identify genes which 
15 are essential to the cell is modified from previously published methods (Labigne-Roussel et 

al., 1988, J. Bacteriology 170, pp. 1704-1708; Cover et al.,1994, J. Biological Chemistry 

269, pp. 10566-10573; Reyrat et al., 1995, Proc. Natl. Acad. ScL 92, pp 8768-8772). The 

result is a gene "knock-out." 

20 Identification and Cloning of H. pylori Gene Sequences 

The sequences of the genes or ORFs (open reading frames) selected as knock-out 
targets are identified from the K pylori genomic sequence and used to design primers to 
specifically amplify the genes/ORFs. All synthetic oligonucleotide primers are designed 
with the aid of the OLIGO program (National Biosciences, Inc.. Plymouth, MN 55447. 

25 USA), and can be purchased from Gibco/BRL Life Technologies (Gaithersburg, MD, 
USA). If the ORF is smaller than 800 to 1000 base pairs, flanking primers are chosen 
outside of the open reading frame. 
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5 Genomic DNA prepared from the Helicobacter pylori HP-J99 strain (ATCC 55679 ) 

is used as the source of template DNA for amplification of the ORFs by PCR (polymerase 
chain reaction) (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. 
Ausubel et al., editors, 1994). For the preparation of genomic DNA from H. pylori, see 
Example I. PCR amplification is carried out by introducing 10 nanograms of genomic HP- 

10 J99 DNA into a reaction vial containing 10 mM Tris pH 8.3, 50 mM KCL 2 mM MgCL, 2 
microMolar synthetic oligonucleotide primers (forward=Fl and reverse=Rl). 0.2 mM of 
each deoxynucleotide triphosphate (dATP,dGTP, dCTP, dTTP), and 1 .25 units of heat 
stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA) 
in a final volume of 40 microliters. The PCR is carried out with Perkin Elmer 

1 5 Cetus/GeneAmp PCR System 9600 thermal cyclers. 

Upon completion of thermal cycling reactions, each sample of amplified DNA is 
visualized on a 2% TAE agarose gel stained with Ethidium Bromide (Current Protocols in 
Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors. 1994) to 
determine that a single product of the expected size had resulted from the reaction. 

20 Amplified DNA is then washed and purified using the Qiaquick Spin PCR purification kit 
(Qiagen, Gaithersburg, MD, USA). 

PCR products are cloned into the pT7Blue T-Vector (catalog#69820-L Novagen, 
Inc., Madison, WI, USA) using the TA cloning strategy (Current Protocols in Molecular 
Biology, John Wiley and Sons, Inc., F. Ausubel et aL editors, 1994). The ligation of the 

25 PCR product into the vector is accomplished by mixing a 6 fold molar excess of the PCR 
product, 10 ng of pT7Blue-T vector (Novagen), 1 microliter of T4 DNA Ligase Buffer 
(New England Biolabs, Beverly, MA, USA), and 200 units of T4 DNA Ligase (New 
England Biolabs) into a final reaction volume of 10 microliters. Ligation is allowed to 
proceed for 16 hours at 16°C. 

30 Ligation products are electroporated (Current Protocols in Molecular Biology, John 

Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) into electroporation-competent XL-1 
Blue or DH5-a E.coli cells (Clontech Lab., Inc. Palo Alto, CA, USA). Briefly, 1 microliter 
of ligation reaction is mixed with 40 microliters of electrocompetent cells and subjected to 
a high voltage pulse (25 microFarads, 2.5 kV, 200 ohms) after which the samples are 

35 incubated in 0.45 ml SOC medium (0.5% yeast extract, 2% tryptone, 10 mM NaCL 2.5 mM 
KCK 10 mM MgCL, 10 mM MgS0 4 and 20 mM glucose) at 37°C with shaking for 1 hour. 
Samples are then spread onto LB (10 g/1 bacto tryptone, 5 g/1 bacto yeast extract. 10 g/1 
sodium chloride) plates containing 100 microgram/ml of Ampicillin, 0.3% X-gaL and 100 
microgram/ml IPTG. These plates are incubated overnight at 37°C. Ampicillin-resistant 

40 colonies with white color are selected, grown in 5 ml of liquid LB containing 100 

microgram/ml of Ampicillin, and plasmid DNA is isolated using the Qiagen miniprep 
protocol (Qiagen. Gaithersburg, MD, USA). 
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5 To verify that the correct Kpylori DNA inserts had been cloned, these pT7Blue 

plasmid DNAs are used as templates for PCR amplification of the cloned inserts, using the 
same forward and reverse primers used for the initial amplification of the J99 H. pylori 
sequence. Recognition of the primers and a PCR product of the correct size as visualized 
on a 2% TAE. ethidium bromide stained agarose gel are confirmation that the correct 

10 inserts had been cloned. Two to six such verified clones are obtained for each knock-out 

target, and frozen at -70°C for storage. To minimize errors due to PCR, plasmid DNA from 
these verified clones are pooled, and used in subsequent cloning steps. 

The sequences of the genes/ORFs are again used to design a second pair of primers 
which flank the region of H. pylon DNA to be either interrupted or deleted (up to 250 

1 5 basepairs) within the ORFs but are oriented away from each other. The pool of circular 
plasmid DNAs of the previously isolated clones are used as templates for this round of 
PCR. Since the orientation of amplification of this pair of deletion primers is away from 
each other, the portion of the ORF between the primers is not included in the resultant PCR 
product. The PCR product is a linear piece of DNA with H. pylori DNA at each end and 

20 the pT7Blue vector backbone between them which, in essence, results in the deletion of a 
portion of the ORFs. The PCR product is visualized on a 1% TAE, ethidium bromide 
stained agarose gel to confirm that only a single product of the correct size has been 
amplified. 

A Kanamycin-resistance cassette (Labigne-Roussel et al., 1988 J. Bacteriology 170, 

25 1 704-1 708) is ligated to this PCR product by the TA cloning method used previously 
(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., 
editors, 1994). The Kanamycin cassette containing a Campylobacter kanamycin resistance 
gene is obtained by carrying out an EcoRI digestion of the recombinant plasmid 
pCTBS:kan (Cover et al.,1994, J. Biological Chemistry 269, pp. 10566-10573). The proper 

30 fragment (1 .4 kb) is isolated on a 1% TAE gel, and isolated using the QIAquick gel 

extraction kit (Qiagen, Gaithersburg, MD, USA). The fragment is end repaired using the 
Klenow fill-in protocol, which involved mixing 4ug of the DNA fragment, 1 microliter of 
dATP,dGTP, dCTP, dTTP at 0.5 mM, 2 microliter of Klenow Buffer (New England 
Biolabs) and 5 units of Klenow DNA Polymerase I Large (Klenow) Fragment (New 

35 England Biolabs) into a 20 microliter reaction, incubating at 30°C for 15 min, and 

inactivating the enzyme by heating to 75°C for 10 minutes. This blunt-ended Kanamycin 
cassette is then purified through a Qiaquick column (Qiagen, Gaithersburg, MD. USA) to 
eliminate nucleotides. The "T" overhang is then generated by mixing 5 micrograms of the 
blunt-ended kanamycin cassette, 10 mM Tris pH 8.3. 50 mM KC1, 2 mM MgCl 2 > 5 units of 

40 DNA Polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA), 20 
microliters of 5 mM dTTP, in a 100 microliter reaction and incubating the reaction for 2 
hours at 37°C. The "Kan-T n cassette is purified using a QIAquick column (Qiagen, 
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5 Gaithersburg, MD, USA). The PCR product of the deletion primers (F2 and R2) is ligated 
to the Kan-T cassette by mixing 10 to 25 ng of deletion primer PCR product, 50 - 75 ng 
Kan-T cassette DNA, 1 microliter lOx T4 DNA Ligase reaction mixture, 0.5 microliter T4 
DNA Ligase (New England Biolabs, Beverly, MA. USA) in a 10 microliter reaction and 
incubating for 16 hours at 16°C. 

10 The ligation products are transformed into XL-1 Blue or DH5-cc E.coli cells by 

electroporation as described previously. After recovery in SOC, cells are plated onto LB 
plates containing 100 microgram/ml Ampicillin and grown overnight at 37°C. These plates 
are then replica plated onto plates containing 25 microgram/ml Kanamycin and allowed to 
grow overnight. P.esultant colonies have both the Ampicillin resistance gene present in the 

15 pT7Blue vector, and the newly introduced Kanamycin resistance gene. Colonies are picked 
into LB containing 25 microgram/ml Kanamycin and plasmid DNA is isolated from the 
cultured cells using the Qiagen miniprep protocol (Qiagen, Gaithersburg, MD, USA). 

Several tests by PCR amplification are conducted on these plasmids to verify that 
the Kanamycin is inserted in the H. pylori gene/ORF, and to determine the orientation of 

20 the insertion of the Kanamycin-resistance gene relative to the H. pylori gene/ORF. To 
verify that the Kanamycin cassette is inserted into the H. pylori sequence, the plasmid 
DNAs are used as templates for PCR amplification with the set of primers originally used 
to clone the H. pylori gene/ORFs. The correct PCR product is the size of the deleted 
gene/ORF but increased in size by the addition of a 1 .4 kilobase Kanamycin cassette. To 

25 avoid potential polar effects of the kanamycin resistance cassette on H. pylori gene 

expression, the orientation of the Kanamycin resistance gene with respect to the knock-out 
gene/ORF is determined and both orientations are eventually used in H. pylori 
transformations (see below). To determine the orientation of insertion of the kanamycin 
resistance gene, primers are designed from the ends of the kanamycin resistance gene 

30 ("Kan-l" S'-ATCTTACCTATCACCTCAAAT-S' (SEQ ID NO:267), and "Kan-2" 5'- 

AGACAGCAACATCTTTGTGAA-3' (SEQ ID NO:268)). By using each of the cloning 
primers in conjunction with each of the Kan primers (4 combinations of primers), the 
orientation of the Kanamycin cassette relative to the H. pylori sequence is determined. 
Positive clones are classified as either in the "A" orientation (the same direction of 

35 transcription is present for both the H. pylori gene and the Kanamycin resistance gene), or 
in the "B" orientation (the direction of transcription for the H. pylori gene is opposite to that 
of the Kanamycin resistance gene). Clones which share the same orientation (A or B) are 
pooled for subsequent experiments and independently transformed into H. pylori. 

40 Transformation of Plasmid DNA into K pylori cells 

Two strains of H. pylori are used for transformation: HP-J99 (ATCC 55679) , the 
clinical isolate which provided the DNA from which the H. pylori sequence database is 
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5 obtained, and AH244, an isolate which had been passaged in, and has the ability to colonize 
the mouse stomach. Cells for transformation are grown at 37°C, 10% C0 2 , 100% humidity, 
either on Sheep-Blood agar plates or in Brucella Broth liquid. Cells are grown to 
exponential phase, and examined microscopically to determine that the cells are "healthy" 
(actively moving cells) and not contaminated. If grown on plates, cells are harvested by 

1 0 scraping cells from the plate with a sterile loop, suspended in 1 ml of Brucella Broth, spun 
down (1 minute, top speed in eppendorf microfuge) and resuspended in 200 microliters 
Brucella Broth. If grown in Brucella Broth liquid, cells are centrifuged (15 minutes at 3000 
rpm in a Beckman TJ6 centrifuge) and the cell pellet resuspended in 200 microliters of 
Brucella broth . An aliquot of cells is taken to determine the optical density at 600 nm, in 

15 order to calculate the concentration of cells. An aliquot (1 to 5 OD 600 units/25 microliter) 
of the resuspended cells is placed onto a prewarmed Sheep-Blood agar plate, and the plate 
is further incubated at 37°C, 6% C0 2 , 100% humidity for 4 hours. After this incubation, 
10 microliters of plasmid DNA (100 micrograms per microliter) is spotted onto these cells. 
A positive control (plasmid DNA with the ribonuclease H gene disrupted by kanamycin 

20 resistance gene) and a negative control (no plasmid DNA) are done in parallel. The plates 
are returned to 37°C, 6% C0 2 for an additional 4 hours of incubation. Cells are then 
spread onto that plate using a swab wetted in Brucella broth, and grown for 20 hours at 
37°C, 6% C0 2 . Cells are then transferred to a Sheep-Blood agar plate containing 25 
micrograms/ml Kanamycin, and allowed to grow for 3 to 5 days at 37°C, 6% C0 2 , 1 00% 

25 humidity. If colonies appear, they are picked and regrown as patches on a fresh Sheep- 
Blood agar plate containing 25 micrograms/ml Kanamycin. 

Three sets of PCR tests are done to verify that the colonies of transformants have 
arisen from homologous recombination at the proper chromosomal location. The template 
for PCR (DNA from the colony) is obtained by a rapid boiling DNA preparation method as 

30 follows. An aliquot of the colony (stab of the colony with a toothpick) is introduced into 
100 microliters of 1% Triton X-100, 20 mM Tris, pH 8.5, and boiled for 6 minutes. An 
equal volume of phenol : chloroform (1:1) is added and vortexed. The mixture is 
microfuged for 5 minutes and the supernatant is used as DNA template for PCR with 
combinations of the following primers to verify homologous recombination at the proper 

35 chromosomal location. 

TEST 1 . PCR with cloning primers originally used to amplify the gene/ORF. A 
positive result of homologous recombination at the correct chromosomal location should 
show a single PCR product whose size is expected to be the size of the deleted gene/ORF 
but increased in size by the addition of a 1.4 kilobase Kanamycin cassette. A PCR product 

40 of just the size of the gene/ORP is proof that the gene had not been knocked out and that 
the transformant is not the result of homologous recombination at the correct chromosome 
location. 
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5 TEST 2. PCR with F3 (primer designed from sequences upstream of the gene/ORF 

and not present on the plasmid), and either primer Kan-1 or K.an-2 (primers designed from 
the ends of the kanamycin resistance gene), depending on whether the plasmid DN A used 
was of M A" or "B" orientation. Homologous recombination at the correct chromosomal 
location will result in a single PGR product of the expected size (i.e., from the location of 

10 F3 to the insertion site of kanamycin resistance gene). No PCR product or PCR product(s) 
of incorrect size(s) will prove that the plasmid had not integrated at the correct site and that 
the gene had not been knocked out. 

TEST 3. PCR with R3 (primer designed from sequences downstream of the 
gene/ORF and not present on the plasmid) and either primer Kan-1 or Knn-2 depending on 

1 5 whether the plasmid DNA used was of "A" or "B" orientation. Homologous recombination 
at the correct chromosomal location will result in a single PCR product of the expected size 
(i.e., from the insertion site of kanamycin resistance gene to the downstream location of 
R3). Again, no PCR product or PCR product(s) of incorrect size(s) will prove that the 
plasmid had not integrated at the correct site and that the gene had not been knocked out. 

20 Transformants showing positive results for all three tests above indicate that the 

gene is not essential for survival in vitro, 

A negative result in any of the three above tests for each transformant indicates that 
the gene had not been disrupted, and that the gene is essential for survival in vitro. 

In the event that no colonies result from two independent transformations while the 

25 positive control with the disrupted ribonuclease H plasmid DNA produces transformants, 
the plasmid DNA is further analyzed by PCR on DNA from transformant populations prior 
to plating for colony formation. This will verify that the plasmid can enter the cells and 
undergo homologous recombination at the correct site. Briefly, plasmid DNA is incubated 
according to the transformation protocol described above, DNA is extracted from the H. 

30 pylori cells immediately after incubation with the plasmid DNAs and the DNA is used as 
template for the above TEST 2 and TEST 3. Positive results in TEST 2 and TEST 3 would 
verify that the plasmid DNA could enter the cells and undergo homologous recombination 
at the correct chromosomal location. If TEST 2 and TEST 3 are positive, then failure to 
obtain viable transformants indicates that the gene is essential, and cells suffering a 

35 disruption in that gene are incapable of colony formation 

VII. High-throughput drug screen assay 
Cloning, expression and protein purification 

Cloning, transformation, expression and purification of the H. pylori target gene and 
40 its protein producUe. g. ? an H. pylori enzyme, to be used in a high-throughput drug screen 
assay, is carried out essentially as described in Examples II and III above. Development 
and application of a screening assay for a particular H. pylori gene product, peptidyl-propyl 
cis-trans isomerase, is described below as a specific example. 
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5 

Enzymatic Assay 

The assay is essentially as described by Fisher (Fischer. G., et.al. (1984) Biomed 
Biochim. Acta 43:1 101-1 1 1 1). The assay measures the cis-trans isomerization of the Ala- 
Pro bond in the test peptide N-succinyl-Ala-Ala-Pro-Phe-p-nitroanilide (Sigma # S-7388, 

10 lot # 84H5805). The assay is coupled with cx-chymotrypsin, where the ability of the 
protease to cleave the test peptide occurs only when the Ala-Pro bond is in trans. The 
conversion of the test peptide to the trans isomer in the assay is followed at 390 nm on a 
Beckman Model DU-650 spectophotometer. The data are collected ever)' second with an 
average scanning of time of 0.5 second Assays are carried nut in 35 mM Hepes. pH 8.0, in 

15 a final volume of 400 ul, with 10 jj.M ot-chymotrypsin (type 1-5 from bovine Pancreas, 
Sigma # C-7762, lot 23H7020) and 10 nM PPIase. To initiate the reaction, 10 \i\ of the 
substrate ( 2 mM N-Succinyl-Ala-Ala-Pro-Phe-p-nitroanilide in DMSO) is added to 390 \x\ 
of reaction mixture at room temperature. 

20 Enzymatic assay in crude bacterial extract. 

A 50 ml culture of Helicobacter pylori (strain J99) in Brucella broth is harvested at 
mid-log phase (OD6 0 o nm - 1) and resuspended in lysis buffer with the following protease 
inhibitors: 1 mM PMSF, and 10 ng/ml of each of aprotinin, leupeptin, pepstatine, TLCK, 
TPCK, and soybean trypsin inhibitor. The suspension is subjected to 3 cycles of freeze- 

25 thaw (15 minutes at -70 ° C, then 30 minutes at room temperature), followed by sonication 
(three 20 second bursts). The lysate is centrifuged (12,000 g x 30 minutes) and the 
supernatant is assayed for enzymatic activity as described above. 

Many H. pylori enzymes can be expressed at high levels and in an active form in E. 
coli. Such high yields of purified proteins provide for the design of various high 

30 throughput drug screening assays. 

Other Embodiments 

Included in the invention are: allelic variations; natural mutants; induced mutants; 
proteins encoded by DN A that hybridizes under high or low stringency conditions to a 

35 nucleic acid which encodes a polypeptide as shown in SEQ ID NOs: 1-1 14 (for definitions 
of high and low stringency see Current Protocols in Molecular Biology, John Wiley & 
Sons, New York, 1989, 6.3.1 - 6.3.6, hereby incorporated by reference); and, polypeptides 
specifically bound by antisera to H. pylori polypeptides, especially by antisera to an active 
site or binding domain of H. pylori polypeptide 

40 The invention also includes fragments, preferably biologically active fragments, or 

analogs of//, pylori polypeptides A biologically active fragment or analog is one having 
any in vivo or in vitro activity which is characteristic of the H. pylori polypeptides shown in 
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5 SEQ ID NOs: 1 1 5-228. or of other naturally occurring H. pylori polypeptides, e.g., one or 
more of the biological activities described above. Especially preferred Eire fragments which 
exist in vivo, e.g., fragments which arise from post transcriptional processing or which arise 
from translation of alternatively spliced RNA's. Fragments include those expressed in 
native or endogenous cells as well as those made in expression systems, e.g.. in CHO cells. 

10 Because peptides such as H. pylori polypeptides often exhibit a range of physiological 
properties and because such properties may be attributable to different portions of the 
molecule, a useful H. pylori fragment or K pylori analog is one which exhibits a biological 
activity in any biological assay for H. pylori activity. Most preferably the fragment or 
analog possesses 10%, preferably 40%. or at least 90% of the activity of H pylori, in any in 

1 5 vivo or in vitro assay. 

Analogs can differ from naturally occurring H. pylori polypeptides in amino acid 
sequence or in ways that do not involve sequence, or both. Non-sequence modifications 
include changes in acetylation, methylation, phosphorylation, carboxylation, or 
glycosylation. 

20 Preferred analogs include H. pylori polypeptides (or biologically active fragments 

thereof) whose sequences differ from the wild-type sequence by one or more conservative 
amino acid substitutions or by one or more non-conservative amino acid substitutions, 
deletions, or insertions which do not abolish the biological activity of the H. pylori 
polypeptide. Conservative substitutions typically include the substitution of one amino 

25 acid for another with similar characteristics, e.g., substitutions within the following groups: 
valine, glycine; glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; 
asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. 
Other conservative substitutions can be taken from the table below. 

30 TABLE 9 

CONSERVATIVE AMINO ACID REPLACEMENTS 



For Amino Acid 


Code 


Replace with any of 


Alanine 


A 


D-Ala, Gly, beta-Ala, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met. He, D- 
Met, D-Ile,Orn ? D-Orn 


Asparagine 


N 


D-Asn. Asp, D-Asp, Glu, D-Glu, Gin, D-GIn 


Aspartic Acid 


D 


D-Asp. D-Asn, Asn. Glu, D-Glu ; Gin, D-GIn 


Cysteine 


C 


D-Cys. S-Me-Cys. Met, D-Met, Thr, D-Thr 


Glutamine 


0 


D-Gln. Asn. D-Asn, Glu, D-Glu, Asp, D-Asp 


Glutamic Acid 


E 


D-Glu, D-Asp, Asp ? Asn ? D-Asn, Gin, D-Gln 


Glycine 


G 


Ala, D-Ala, Pro, D-Pro ? p-A)a Acp 


Isoleucine 


I 


D-Ile, Val, D-VaL Leu, D-Leu, Met, D-Met 


Leucine 


L 


D-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met 


Lysine 


K 


D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D-Met, 
He. D-Ile, Oni, D-Orn 


Methionine 


M 


D-Met. S-Me-Cys. He. D-Ile, Leu. D-Leu, Val. D-Val 
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Phenylalanine 


F 


D-Phe, Tyr. D-Thr, L-Dopa, His. D-His, Trp, D-Trp, 
Trans-3,4, or 5-phenyIproline, cis-3,4, or 5-phenylproline 


Proline 


P 


D-Pro, L-MhioazoHdine-4-carboxylic acid. D-or L-l- 
oxazolidine-4-carboxylic acid 


Serine 


S 


D-Sen Thr, D-Thr, allo-Thr. Met,D-Met, Met(O), D- 
Met(O), L-Cys, D-Cys 


Threonine 


T 


D-Thr, Ser, D-Ser, allo-Thr, Met.D-Met, Met(O), D- 
Met(O), Val, D-Val 


Tyrosine 


Y 


D-Tyr, Phe, D-Phe, L-Dopa, His, D-His 


Valine 


V 


D-Val. Leu. D-Leu, lie, D-Ile, Met.D-Met 



5 



Other analogs within the invention are those with modifications which increase 
peptide stability; such analogs may contain, for example, one or more non-peptide bonds 
(which replace the peptide bonds) in the peptide sequence. Also included are: analogs that 
include residues other than naturally occurring L-amino acids, e.g., D-amino acids or non- 

10 naturally occurring or synthetic amino acids, e.g., p or y amino acids; and cyclic analogs. 

As used herein, the term "fragment", as applied to a H. pylori analog, will ordinarily 
be at least about 20 residues, more typically at least about 40 residues, preferably at least 
about 60 residues in length. Fragments of//, pylori polypeptides can be generated by 
methods known to those skilled in the art. The ability of a candidate fragment to exhibit a 

1 5 biological activity of K pylori polypeptide can be assessed by methods known to those 
skilled in the art as described herein. Also included are H. pylori polypeptides containing 
residues that are not required for biological activity of the peptide or that result from 
alternative mRNA splicing or alternative protein processing events. 

In order to obtain an H. pylori polypeptide, H. pylori polypeptide-encoding DNA 

20 can be introduced into an expression vector, the vector introduced into a cell suitable for 
expression of the desired protein, and the peptide recovered and purified, by prior art 
methods. Antibodies to the peptides an proteins can be made by immunizing an animal, 
e.g.. a rabbit or mouse, and recovering anti-//. pylori polypeptide antibodies by prior art 
methods. 

25 The nucleic acids and corresponding polypeptides of the invention were disclosed 

previously in the corresponding US application, U.S. S.N. 08/561 ,469, filed November 1 7, 
1995 (Attorney Docket No.: GTN-001CP). The correlation between sequence 
identification numbers in the above-identified parent applications and sequence 
identification numbers provided herein is outlined in Table 10 below. 
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5 TABLE 10 



Parent 


Relation 


New 


Parent 


Parent 


Relation- 


New 


Nucleic 


-ship 


Nucleic 


Amino 


amino acid 


ship 


Amino 


Acid SEQ 




Acid SEQ 


Acid SEQ 


i inn ±i 

HPP # 




Acid SbQ 


ID NO: 




ID NO: 


ID NO. 






ID [NO. 


OD 1 


a 


1 


nor 

385 


*} 


a 


i i ^ 
1 1 J 


882 


a 


2 


39U 


y 


a 


i lo 


883 


a 


*> 
3 


401 


20 


a 


1 1 "7 

I 1 7 


884 


a 


4 


407 


26 


a 


ii© 
1 1 8 


885 


a 


5 


409 


28 


a 


1 19 


886 


a 


6 


410 


29 


a 


120 


887 


a 


7 


413 


34 


a 


121 


888 


a 


t? 


431 






1 22 


889 


a 


9 


435 


60 


a 


123 


890 


a 


10 


442 


68 


a 


124 


891 


a 


1 1 


445 


71 


a 


125 


892 


a 


12 


449 


75 


a 


126 


893 


a 


13 


463 


89 


a 


127 


894 


a 


14 


464 


90 


a 


128 


895 


a 


15 


467 


94 


a 


129 


896 


a 


16 


470 


97 


a 


130 


897 


a 


17 


474 


101 


a 


131 


898 


a 


18 


476 


103 


a 


132 


899 


a 


19 


477 


104 


a 


133 


900 


a 


20 


480 


107 


a 


134 


901 


a 


21 


485 


1 14 


a 


toe 

135 


902 


a 


22 


487 


1 16 


a 


136 


903 


a 


23 


502 


133 


a 


137 


904 


a 


24 


507 


139 


a 


too 

1 38 


905 


a 


25 


508 


1 40 


a 


139 


906 


a 


26 


51 1 


1 44 


a 


140 


907 


a 


27 


515 


\ A G 

148 


a 


1 A 1 

141 


908 


a 


28 


517 


1 C C\ 

1 50 


a 


1 42 


909 


a 


29 


521 


1 C A 

I :>4 


a 


1 A1 


910 


a 


30 


526 


1/11 

1 6 1 


a 


\ A A 

\ 44 


91 1 


a 


3 I 


C 1 A 

534 


1 /U 


a 


I 4D 


912 


a 


32 


538 


1 TC 

l /:> 


a 


1 40 


913 


a 


33 


c a ^ 

54 1 


1 78 


a 


i ah 
1 4 / 


914 


a 


"> A 

34 


C A C 

54 :> 


1 53 


a 


1 45 


9 1 ^ 


a 


3:> 


:>49 


1 0"7 

1 87 


a 


1 /to 
1 4V 


916 


a 


36 


551 


i on 
1 89 


a 


1 jU 


Q 1 *7 


a 


1 7 




t on 


a 


1 S 1 


918 


a 


38 


557 


195 


a 


152 


919 


a 


39 


559 


197 


a 


15.1 


920 


a 


40 


561 


199 


a 


154 


921 


a 


41 


569 


209 


a 


155 


922 


a 


42 


571 


211 


a 


156 


923 


a 


43 


580 


220 


a 


157 


924 


a 


44 


584 


224 


a 


158 
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925 


a 


45 


594 


234 


a 


159 


926 


a 


46 


595 


235 


a 


160 


927 


a 


47 


615 


256 


a 


161 


928 


a 


48 


616 


257 


a 


162 


929 


a 


49 


624 


265 


a 


163 


930 


a 


50 


626 


267 


a 


164 


931 


a 


51 


628 


269 


a 


165 


932 


a 


52 


632 


273 


a 


166 


933 


a 


53 


634 


275 


a 


167 


934 


a 


54 


637 


279 


a 


168 


935 


a 


55 


641 


283 


a 


169 


936 


a 


56 


644 


287 


a 


170 


937 


a 


57 


645 


288 


a 


!7! 


938 


a 


58 


646 


289 


a 


172 


939 


a 


59 


648 


291 


a 


173 


940 


a 


60 


652 


296 


a 


174 


941 


a 


61 


662 


307 


a 


175 


942 


a 


62 


671 


316 


a 


176 


943 


a 


63 


672 


317 


a 


177 


944 


a 


64 


675 


320 


a 


178 


945 


a 


65 


677 


322 


a 


179 


946 


a 


66 


684 


331 


a 


180 


947 


a 


67 


685 


332 


a 


181 


948 


a 


68 


686 


333 


a 


182 


949 


a 


69 


693 
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a 
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950 


a 


70 


703 


356 


a 


184 


951 


a 


71 


704 


357 


a 
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952 


a 


72 


709 


363 


a 


186 


953 


a 


73 


710 


364 


a 


187 


954 


a 


74 


71 1 


366 


a 


188 


955 


a 


75 


715 
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a 


189 


956 


a 


76 


723 


380 


a 


190 
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a 


77 


724 


381 


a 


191 


958 


a 


78 


731 
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a 


192 


959 


a 


79 


734 


391 


a 


193 


960 


a 


80 


745 


406 


a 


194 


961 


a 


81 


753 


415 


a 


195 


962 


a 


82 


754 


416 


a 


196 


963 


a 


83 


758 


420 


a 


197 


964 


a 


84 


760 


422 


a 


198 


965 


a 


85 


764 


426 


a 


199 


966 


a 


86 


765 


427 


a 


200 


967 


a 


87 


766 


428 


a 


201 


968 


a 


88 


774 


437 


a 


202 


969 


a 


89 


776 


439 


a 


203 


970 


a 


90 


778 


441 


a 


204 


971 


a 


91 


785 


448 


a 


205 


972 


a 


92 


788 


452 


a 


206 


973 


a 


93 


793 


457 


a 


207 


974 


a 


94 


795 


459 


a 


208 
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3 


95 


797 


461 


3 


209 


976 


3 


96 


806 


471 


a 


210 


977 


a 


97 


812 


477 


a 


21 1 


978 


3 


98 


820 


486 


a 


212 


979 




99 


823 


489 


a 


213 




3 


100 


827 


493 


a 


214 


!T O I 


3 


101 


833 


499 


a 


215 


/Oi 


a 


lO' 5 


834 


500 


a 


216 


QR1 

7<JJ 


o 
a 




842 


509 


a 


217 


"OH 


a 


104 


85^ 


521 


a 


218 


QRs 

70J 




1 \J ~> 


RS4 


523 


a 


219 




a 
& 


106 


858 


529 


a 


220 






107 


861 


536 




22 I 


988 


a 


108 


864 


539 


a 


222 


989 


a 


109 


865 


540 


a 




990 


a 


110 


867 


542 


a 


224 


991 


a 


111 


872 


548 


a 


225 


992 


a 


112 


877 


553 


a 


226 


993 


a 


1 13 


878 


554 


a 


227 


994 


a 


114 


880 


556 


a 


228 



5 a=sequences from USSN 08/561,469, filed November 1 7, 1995 (Attorney Docket No.:GTN- 
001 CP). 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
1 0 routine experimentation, many equivalents to the specific embodiments and methods 
described herein. Such equivalents are intended to be encompassed by the scope of the 
following claims. 

Other embodiments are within the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT 

(A) NAME: Astra Aktiebolag 

(B) STREET: S-I51 85 

(C) CITY: Sodertalje 

( D ) STATE : 

( E ) COUNTRY : Sweden 

(F) POSTAL CODE (ZIP) : 



Hi) TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES 

RELATING TO HELICOBACTER PYLORI FOR 
DIAGNOSTICS AND THERAPEUTICS 



(iii) NUMBER OF SEQUENCES: 268 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: CD/ ROM ISO9660 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



<vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/561,469 

(B) FILING DATE: 17-NOV-1995 



(vii) CORRESPONDENCE ADDRESS: 

(A) ADDRESEE: LAHIVE & COCKFIELD 

(B) STREET: 60 State Street, Suite 510 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02109-1875 



(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Mandragouras , Amy E. 

{ B) REGISTRATION NUMBER: 36,207 

(C) REFERENCE / DOCKET NUMBER: GTN-001CPPC 



(x) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (617)227-7400 
{ B ) TELEFAX: (617)227-59 41 
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(2) INFORMATION FOR SEQ ID NO ; 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

( IV) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.feature 

(B) LOCATION 1 . . .519 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 

ATGAAAGGYC CYATCCTATG GCCGGCGTTT TCTCAATTTA GCGATCAAGA TTTGAGCGAT 6 0 

ATTGTGGCGT ATCTCACTTC TATTTTGCCT AAAAATTTGA GCGATAAGGA AGTGTTCGCG 120 

CAAAGTTGTC AAAGGTGCCA TAGCCTGGAT TATGCTAAAG ATAAGGCCTT TAGCGATCCT 180 

AAAGATTTAG CCAATTATTT AGGCTCTCAT GCGCCTGATT TGTCCATGAT GATTAGGGCT 24 0 

AAGGGCGAAC ATGGCTTGAA TGTTTTCATC AACGATCCGC AAAAGC TTTT GCCTGGCACA 3 00 

GCCATGCCTA GAG TGGG ATT GAATGAAAAA GCTCAAAAAC AAGTCATTTC TTATTTGGAA 3 60 

AAAGCGGGCG ATAGGAAAAA GCATGAAAGG AATACTTTAG GGATTAAGAT CATGATTTTC 42 0 

TTTGCGGTGC TGTCGTTCTT GGCTTACGCT GGAAAAGAAA AGTTTGGAGC GAAGTGCATT 480 

AAATTTAAAA AAGGGGGGAC ATGGTTTTAT GATTTTTAA 519 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . . 186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 

ATGCAAGAAT TCAGTTTGTG GTGCGATTTT ATAGAAAGGG ATTTTTTAGA AAACGATTTT 60 

TTAAAGCTCA TCAATAAGGG GGCTATTTGC GGGRCGACGA GTAACCCTAG TTTGTTTTGC 12 0 

GAAGCGATCA CAAAAAGCGC GTTTTATCAA GATGAAATCG CTAAAMCTCA AAGGCAAAAA 18 0 

AGCTAA 18 6 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A} ORGANISM : Helicobacter pylori 

fix) FEATURE: 

(A) NAME/ KEY : misc_feature 
(BJ LOCATION 1 . . . 861 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 

TTGAASCCGA TGAAAGTGAT TCAAGTTTTT TTATTTTCCA ACCCTTTTTG CGCGATTGTG 60 

CCTAACACGG AGCCAGAACA TTTuunuum x^^^^w^o - .^A^^ ^ * * * * ^ * * w 

GCT^ATAAAT ATTTTTTAGA CCATGCTCAA AAAAGAGTGA TCTATAAAGA AGATCCTTTT 180 

TTAAAAAACT ATTCTAAAGA CGCCATTGTT TTAGAAAAAA AAGACATTTA TAATATCCAA 24 0 

TACATTTTAA AAGACGGAGA GCCTTACACT TCGTTTGAAT TGAAAAATTT GGGGGCTTTT 3 00 

TTGGTGTGGG GGTTAGGCGA ACATAACGCC ACGAATGCGA GTTTGGCGAT TTTAAGCGCT 3 60 

TTAGATGAAT TAAATTTAGA AGAAATTAGA AATAATTYAT TGAATTTTAA AGGCATTAAA 420 

AAACGCTTTG ATATTTTGCA AAAAAACAAT CTCATTCTCA TTGATGATTA CGCCCACCAC 4 80 

CCTACTGAAA TTGGCRCCAC TTTAAAAAGC GCTAGGATTT ATGCCAATTT ATTGAATACG 54 0 

CAAGAAAAAA TTATAGTGAT CTGGCAAGCG CACAAATACT CTCGCTTAAT GGACAATTTA 60 0 

GAAGAATTTA AAAAATGTTT TTTAGAGCAT TGCGACAGGT TGATCATTTT ACCCGTTTAT 6 60 

AGCGCGAGTG AAGTTAAAAG AGACATTGAT TTGAAAGCCC ATTTTAAGCA TTATAACCCC 72 0 

ACCTTTATAG ACAGGGTGCG TAAAAAGGGG GATTTTTTAG AGCTGTTAGT CAATGATAAT 7 80 

GTGGTAGAAA CGATTGAAAA AGGCTTTGTG ATAGGCTTTG GAGCGGGGGA TATTACCTAT 840 

CAGTTGAGAG GCGAAATGTA A 86 1 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

{ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : mi sc_ feature 

(B) LOCATION 1 . . . 186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 

TTGTTGCTTT TCTTTCTTTT GAARGGCGTC GTTTTTTCTT TGGGCTTTTT TTCCTTTTTT 60 

GAAGAAGTCT CTGGCTCTTT TGRAGCTGTT TCTTTGARCG TGTTAGCGTT AGTCATGGGG 12 0 

TCTAGTYCTG GGTTAGAAGA ATTCTGTGTC TTAGAAGAGC TTATAAATTC AGGGCTATCA 180 

GTATAG 186 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 
{B) LOCATION 1 ... 369 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 

ATGGGTTTTT TAAAAGTTTT AAAACATGAC GCTTT AGGGC AAGTAGGGAA TATTGTTATA 60 

GGGAATTTCT TAATAACGCT CACTGTTTTA GCGGTTTGTT TTTCCTCTCA AAGCGCTGAA 12 0 

GAAACGACCA TGCTCACCCT AAGCTACACG CTCTTTTTTA TTTTAGGGGC GTTTTTATTA 180 

GTCGCAATCA GCGTGGGAGC GATCAAAAAC CTCAACGCGC TTTTTTCTAA AACAGGGCTT 2 4C 

TTAAGCTTTT CCTTACCCAT TAGTTTGGAA TCTTTATTGC TCCCTAAAAT CTTGCTCCCC 3 00 

AKGGTGTTTT TTTATC TTC A GTTTGTTCTG GTTTGTGGCG AGCGTGCGTT TGGGCTATTA 3 60 

CCTTTTTAA 3 69 

(2) INFORMATION FOR SEQ ID NO : 6 : 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 
{ B ) LOCATION 1 ... 564 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 

ATGTTAAAAA CCCACTTAAG CAGCGCTAGG GGCGTTGTGG TGTTGTCTAA GATTTTACCG 60 

GTGAACGTGG TGTTAATGGT GAGCGTGCGC TTGTTTGAAA AGGAATTAAA ACGCAAACCT 120 

TACTACATCA TTGCGAGCGC ACACAGCGAT GAAGGTTTAG AAAAATTAAA AAAATTWGGG 180 

GYTGATATGG TGGKTTYCCC TACAAAACTY ATGGCGCAGA GAGTGAGCGC GAATKGMTGG 2 40 

TGYKTCCTGG ATATGGAAAA TATCTTAGAG CGTTTTATC A ACAAAAAAGA CACGCTTTTA 3 00 

GACTTAGAGG AAGTGATTGT CCCCAAAACC AGCTGGCTTG TGTTAAGGAA ATTAAAAGAA 3 60 

GCCCATTTTA GAGAGATCGC TAAAGCCTTT GTGATTGGTA TCACTCAAAA AGATGGCAAA 420 

TACATCCCCA TGCCTGACGG GGAAACGATT ATTGCAAGCG AATCCAAGCT ATTGATGGTT 480 

GGCACTTCAG AAGGCGTTGC GACCTGTAAG CAACTCATTA CTAGCCACCA AAAACCAAAA 54 0 

GAAGTGGATT ACATTTCATT GTGA 564 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME /KEY: 

(B) LOCATION 



misc_f eature 
1 . . .582 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 



GTGGGGAGCT 
TTGGCCTTGA 
CATAGCGTGG 
TTATTGGTGA 
AAGGCCCACC 
GATAATTTAG 
GGGAGTAAGG 



AACGCGCTCA 
AGCCCTAGCA 



TTTTATTCGT 
ATTTGMATTT 
CAAAGCTCAT 
ATGCGATTAA 
CTAATGTGTA 
GCAACAAGGC 
ATAAGGACAC 

--iLa \_ j. +. x J. n-rt 

GTTTGGAGGA 
CTAGAGCAAG 



GGGGCCTAGT 
GCATTTTGAA 
CGGAAGTCCT 
AAAGCACCCG 
TGATTTGTTG 
GAGTTTTAAG 
GCTAGGGTTT 

/— « <-r\r~* * ^ Tirnr* 

TTTTGAAACG 
GCGTGATCTT 



GGGGTAGGGA 
CGCTTTGACA 
AGCGGTTATG 
CATTGTTTGC 
TTGCAGGTGA 
CATGTGATAC 
TTTAGCACTA 

CATTGTTTCT 
AAAATTCCAT 



AAACAGAATT 
TGAGCGAATA 
TGGGGTTTGA 
TGCTTTTAGA 
TGGAKAACGC 
TGATTATGAC 
AAAACGCCAA 

GTGGAATTGG 
AA 



GGCTAAAGAA 
CAAAGAAGCC 
GCAAGGGGGG 
TGAGATAGAA 
CACTTTGAGC 
KKCAARTGTG 
GTATGATAGA 

^ rn f"» /■*« ^ /**■ rprjtm 

ACGGGTTAAA 



60 
120 
180 
240 
300 
360 
420 
ISO 
S40 
582 



(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 909 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{ D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . . 909 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : S 



ATGGCTTTTC 
ACTCAAAACG 
GCGGCTGATG 
TTGGGTCAAG 
GCTATGGATG 
CAAGATGGGC 
CAAGGTTTGG 
CAATTCACTA 
ATCGGCTCTA 
ACGGCTTCTG 
TTAGAGAGCG 
ATCAATAAAA 
GATGTGGCGG 
AATATCGCAG 
GTCACTTCAG 
AGTATAGGT 



AGGTCAATAC 
CACTTAAAAC 
ACGCATCAGG 
CGATTGCCAA 
AGCAATTAAA 
AAACTACGGA 
ATAATATCGG 
ACAAAGAATT 
CCACTTCCGA 
GGGATATTAG 
TAAAAGTTTC 
ACTCTAACCG 
TCCAGTCAGG 
ATATTAAGMR 
AAACCGGTGT 



AAATATCAAT 
TTCATTGGAG 
CATGACGGTG 
CACGAATGAC 
AATCTTAGAC 
ATCTCGTAAA 
TAACACAACG 
CCAAGTAGGG 
TAAAATCGGT 
CTTGACTTTT 
TAGTTCAGCA 
AAC AGGGGTT 
AAGTTTGAGT 
AAACGACTCA 
GGWAGCTTAT 



GCGATGAATG 
CG ATTGAGTT 
GCGGATTCTT 
GGCATGGGGA 
AC CG TTAAGG 
GCGATTCAAT 
ACTTATAACG 
GCTTATTCTA 
CAGGTTCGTA 
AAACAAGTGG 
GGCACAGGGA 
AAAGCTTATG 
AATTTAACCT 
GACGGAAGGT 
ACGGATCAAA 



CGCATGTGCA 
CAGGTTTAAG 
TGCGTTCGCA 
TTATCCAGGT 
TTAAAGCGAC 
CTGACATCGT 
GGCAAGCGTT 
ACCAAAGCAT 
TCGCTACAGG 
ATGGCGTGAA 
TCGGCGTGTT 
CGAGCGTTAT 
TAAATGGGAT 
TAGTCACAGC 
AAGGACGCTT 



ATCCGCACTC 
GATCAATAAA 
AGCGAGCAGT 
TGCGGATAAG 
TCAAGCGGCT 
TCGTTTGATT 
ATTGTCTGGT 
TAAGGCTTCT 
CGCGTTAATC 
TGATGTAACT 
AGCAGAAGTG 
CACCACGAGC 
TCATTTGGGT 
RATCAATGCG 
GAATTTGCGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
909 



(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I - SENSE : MO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( IX) FEATURE: 

(A) NAME/ KEY : misc.feature 

(B) LOCATION 1...486 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 



ATGTTTTTTA AAACTTATCA AAAATTACTG GGCGCGAGCT GTTTGGCGCT GTATTTAGTG 6 0 

GGCTGTGGGA ATGGTGGTGG CGGTGAATCG CCGGTTGAGA TGATTRCAAA TAGCGAGGGT 12 0 

ACGT^TOAAA TOOAC rr "OCAA AOCAOAT/- 0*"" A mm AC m ^ * a Qr^ r,rTi ' ri ^^j* ^ c^t^t* a ^ m a ^ ^ *i c a 

GGTAATTGTG CTGTCAATTT TGTTCCAGTA AGTGAGACGT TTCAAATGGG TGTTTTAAGT 240 

CAAGTTACTC CAATCTCTAT ACAGGATTTT AAAGATATGG CAAGCACTTA TAAGATATTT 3 00 

GATCAAAAGA AAGGGTTGGC AAACATAGCA AATAAAATTT CTCAATTAGA GCAAAAGGGT 3 60 

GTGATGATGA AACCTCMAAC CCTTAATTTT GGAGAAAGTT TAAAAGGCAT TTCTCAAGGG 42 0 

TGCAATATTA TAGAGGCAGA AATACAAACC GACAAAGGCG CTTGGACTTT TAACTTTGAT 4 80 

AAATAA 4 8 6 



(2) INFORMATION FOR SEQ ID NO : 1 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .2 76 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 



GTGTGTTTAG GGCTAGCTGA TGTGATGGTG GTTTTAAGCT TGCACCTCAA CCTAAACCCC 60 

ACCAACCCTA AATGGCTCAA TAGGGACAGG TTGGTTTTTA GCGGAGGGCA TGCGAGCGCG 12 0 

TTAGTGTATA GTTTGTTGCA TTTGTGGGGC TTTGATTTGA GTTTAGACGA TTTAAAGCGT 180 

TTCAGGCAAT TACACTCTAA AACCCCAGGA CACCCTGAAT TACACCACAC CGAAGGCATT 24 0 

GAAATCACCA CASCCTTTAG GGCAAGGTTT TGCTAA 27 6 



(2) INFORMATION FOR SEQ ID NO: 11: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

MOLECULE TYPE: DNA (genomic) 
HYPOTHETICAL: NO 
ANTI - SENSE : NO 
ORIGINAL SOURCE: 
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(ii) 
(iii) 
(iv) 
(vi) 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : inisc_f eature 

(B) LOCATION 1 . - . 561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 

ATGAC AACAC CGATGATTAT TATTTCCCTA GAAATGGGGT TATCTTTAGT TCCTATGCGA 6 0 

CAATGTCTGG TTTGCCAAGC TCTGGCACGC TCAATTTCTT GGAACGGGTT AGGCGGGAAT 120 

GTCCGTAACA CCAAAGTTTA TGGTAAATTC GCCGCTTACC ACCATTTGCA AAAATATTTA 180 

TTGATAGATT TGATCGCTCG TTTTAAAACG CAAGGGGGCT ATATCTTTAG GTATAACACC 24 0 

GATGATTACT TGCCCTTAAA CTCCACTTTC TAC ATGGGGG GSGTAACCAC GGTGAGAGGC 3 00 

TTTAGGAACG GCTCAATCAC ACCTAAAGAT GAGTTTGGCT TGTGGCTTGG AGGCGATGGG 3 60 

ATT^TTACCR CTTCTACTGA ATTGAGCTAT GGGGTGTTAA AAGCGGCTAA AATGCGTTTA 42 0 

GCGTGGTTT^ TTGACTTTGG TTTCTTAACC TTTAAMACCC CAACTAGGGG GAGTTTCTTC 4 80 

TATAACGCTY CCACCACGAC GGCGAATTTT AAAGATTATG RCGTTGTAGG GRCTGRGTTT 54 0 
GARRGGGCGA CTTGGr-wG^jGC ^ 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 
{ B) LOCATION 1 . . .315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

ATGCAAGCGT TAAAATCATT GCTTGAAGTG ATTACAAAAC TCCAGAATCT AGGCGGCTAT 6 0 

TTGATGCATA TAGCTATTTT CATCATTTTT ATTTGGATTG GAGGRCTTAA GTTTGTGCCT 12 0 

TACGAAGCTG AAGGGATCGC CCCTTTTGTG RCCAACTCCC CTTTCTTTTC TTTCATGTAT 18 0 

AAATTTGAAA AACCTGCATA CAAACAACAC AAAATGTCTG AATCCCAATC CATGCAAGAA 24 0 

GAAATGCAAG ATAACCCTAA AATCGTTGAA AACAAAKAAT GGCATAAAGA AAACCGCACT 300 

TCATTTAGTG GCTGA 315 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 - - .54 9 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

ATGCAGTTTG AAGAAATGAA AGAATTAGCC CATCAAATTG GCGTGTTTTA CCATGTTGGC 60 

G TTG ATGGC A TCGCGCTCTT TTTGTTGCTC TTAAACGCTA TCGTGGTGTT ATTGTCCGTG 12 0 

GTTTATGTCA AAGAGCGTCG TAAAGACTTT GTGATTTGTC TTTTATTGTT AGAMGGGATC 18 0 

TTAATGGGCG TGTTTTCTTC TCTTAATGTG ATCTTTTTCT ACG CTTTTTG GGAAATCTCG 240 

CTCTTGCCGG TTTTATACCT CATCGGTCGT TTTGGCCGTA ATAACAAAAT CTATTCTGGC 3 00 

ATGAAGTTTT TCCTCTACAC CTTTTTAGCG TCGTTGTGCA TGCTTTTAGG CATTTTATAC 3 60 

ATCGGGTATG ACTACGCCAA TAATTACGGC ATGATGAGTT TTGATATTTT AGACTGGTAT 4 20 

CAGTTGAATT TTTCTAGCGG GATTAAAACC TGGCTCTTTG TAGCTTTCTT AATAGGGATT 4 80 

GCGGTTAAAA TCCCGCTCTT TCCCTTCACA CATGGCTGCC TTATGCGTAT TCTAACGCCC 54 0 

CCACTCTAG 54 9 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 51 bacc pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

{ iv) ANTI- SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix) FEATURE: 

(A) NAME / KEY : misc_f eature 
<B) LOCATION 1 . . .351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

GTGAAAAAAT ACGCTGAAGA TTTTATCACC AAAGATGAAG TGAAATCCCT TTTAGAGCGC 60 

TTGGCTAAAG ACTATCCTAC GATTGTAGAA GAGAGTAAAA AAATCCCCAC CGGTGCGATC 120 

CGCTCAGTCT TGCAAGCCTT GTTGCATGAA AAAATCCCCA TTAAGGACAT GCTCACTATT 180 

TTGGAAACGA TTACTGATAT TGCTCCATTG GTTCAAAACG ATGTGAATAT CTTAACCGAA 2 40 

CAAGTGAGGG CGAGGCTTTC YAGGGTGATC ACCAACGCTT TTAAATCTGA AGACGGGCGT 3 00 

TTGAAATTTT TAACCTTTTC TACCGATRGC GAACAATTTT TKGCTCAATA A 3 51 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

fix) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1. . .720 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

ATGATGAAAA ACAAACGCTC TCAAAATAGC CCTTATGTAA CGCCTGACAA CCCTTATCTA 6 0 

ACGCTAGAAA AAGCTTTAGG GTATTCTTTT AAAGACAAGC GTTTATTGGA GCAAGCCTTA 12 0 
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ACGCATAAA? 
GTGTTGGGCT 
AAACTCTCTA 
GCGATTGCTT 
GAAAAACCCT 
GCAGGGTTAG 
TTGGATTTGG 
CAGTTTTGCG 
GAATTTGAAA 
AAAAAAG AAG 



CATGTAAGCT 
TGGTGATAGG 
AATTAAGGGC 
TACAAGATTA 
CTATTCTRTC 
CTAAGGTGCG 
AGCATTTGTT 
TGATCCCCAC 
TGGCTCTATA 
CCGAACAGCA 



CGCTTTAAAC 
GGAGCTGCTA 
TTCTATTGTG 
TTTGCGCGTT 
AAGCGCTTTT 
T AAAATC ATA 
TATGGATTAT 
TTACCAATTA 
CATTCAAGAT 
ATGCGCTTAT 



AATGAGCGCT 
TACCATAAAT 
AGCGCGCATG 
TCTTCTTCTG 
GAGGCTTTAA 
CAAAATTTAC 
AAAACCGCTT 
CTCCAAGAAA 
AAAATGTATG 
CAAGCGCTTC 



TGGAATTTTT 
TCTRTCAWTR 
GTTTCACTAA 
AAGAAATTTC 
TGGCTGGGGT 
TCAATCGTGC 
TGCAGGAATT 
AAGGCCCCGA 
CGACCGCTAA 
AAAACTTAAG 



GGGCGATGCG 
CGATGGGGGC 
ATTAGCGAAA 
TAAGGGGAGG 
GTATTTAGAA 
TTACAAGCGT 
GACCCAAKCK 
TC AC CAT AAA 
AGGCAAGAGT 
GAAGCCAAAT 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



CiiCUidi' 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 ... 687 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 



TTGCTGGTCT 
GTGGTTGTGG 
GAGAATACTA 
ATGTATATCT 
TGGCATTMTA 
CATAATGCGG 
CCTAAAGAGA 
TTGATGTTTG 
KGRGTACAAA 
GCGATTAACA 
ATCAAGTTTT 
CAAGCCATTA 



TACTCAATCT 
CATGGGTGTT 
TTTATGTGTC 
TCAATAACCT 
TTTCTATGTA 
TCGCTTTTGT 
GCGGCCAGCC 
TTTATATTTG 
CCCTTTCTAG 
TGCTTTTAMC 
TAGTTTTAGC 
AAAGCGTGAA 



AAAGAWTACG 
ATGGGGGGTT 
TTTGTGGTAT 
TTCTATCCCC 
TTCAGGCAGT 
CTTTACGAGT 
TATCTTTTCT 
GGCGGGCGGG 
CGYGTTTTCA 
GATGAGAGGC 
CTCAACTTTC 
CGCTTAG 



CC GAATTTGA 
AATATGTTTG 
TACATCGCTA 
ACCTATTTTG 
AATGATGCGC 
GGGGTGATTG 
TACAlAACTCA 
CACCATTTGA 
GTGGTGTTGA 
CAATGGCACC 
TACATGCTTT 



TGTGGCCTTT 
GG AG CATG AG 
CTTATGTGGG 
TCGCTGATAT 
TCATTCAATG 
GCACAATTTA 
CTTTGTTTTC 
TCTATTCCAC 
TCTTGCCTTC 
AGYTCAAAGA 
CCACGCTAGA 



AGATATTATT 
CGTTAGAAGA 
TATAGCGGTG 
GGGGAGCGTT 
GTGGTGGGGG 
TTATTTCTTG 
TTTTTGGAGT 
CGTGSCTGAT 
G YGGGGG AC A 
AAGCCCTTTG 
AGGCTCCATT 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

£A) LENGTH: 489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

Mx) FEATURE: 

(A) NAME /KEY : misc_f eature 
(3) LOCATION 1 . . .4 89 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 7 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
687 
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ATGAAAGCAC 
STATCCACAA 
AAAACCTTAC 
ACAACCAAAG 
GAATTGAAAA 
TGCGTTAAAA 
AGCCCTTTTA 
GCAAGCGTTC 
ATGTCTTGA 



CCTCCCAAYA GGATTTAAAA AAAATC TT AG GGATTGAAGA AGTCATAATS 
GCCCCATGGA ATTACGATTG GCCAATCAAA AGCTAGGCAA TCGTTTCATT 
AAGCCATGAA CGAATTAGAC ATGGGTGCAT TTTTTAACGC TTACGCTCAA 
ATCCCACCCA TGCCACTAGC TATGGCGTTT TTGCGGCGAG TTTGAATATG 
AGGCTTTAAG GCATTATCTT TATGCGCAAA CTTCTAACAT GGTGATCAAC 
GCGTCCCCTT ATCCCAAAAC GACGGGCAAA AAATCTTATT GAGCTTGCAA 
ACCAGCTCAT AGAAAAAACC CTAGAACTAG ACGAAAGCCA CTTGTGCGCA 
AAAACGACAT TAAGGCGATG CAGCATGAGA GTTTATACTC GCGCCTTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
489 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
\u) TOPOLOGY: eixcul**! 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : mis c_f ea ture 

(B) LOCATION 1 ... 180 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

ATGGCGTTTA TTCTAACCAC AAACCTATTT ATCAAGAGTT TTACGAACTC AATTCGCATA 60 
ACGGGTTGTA TTATCAGCCC TAATGTGTTT TTTGCTTATG AATTTTGCGC GTTAGGGTTT 120 
AGAAAAGGGG GGTTAATTTT GGATAATTTT TCTAAATTCG TGAGCCACAG GTTGCAATAA 180 

(2) INFORMATION FOR SEQ ID NO: 19: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .747 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

GTGCGTTTTT TCATTTTTTT AATTCTCATT TGCCCTTTAA TATGCCCCTT AATGAGCGCG 60 

GATAGCGCTT T AC C TAG CG T CAATCTCTCT TTAAACGCTC CTAGTGATCC TAAACAACTC 120 

GTAACCACCC TTAATGTCAT CGCCTTACTC ACGC TTTTGG TTTTAGCCCC ATCGTTGATT 180 

TTAGTGATGA CGAGTTTCAC CCGTTTGATC GTGGTGTTTT CTTTTTTAAG GACCGCTTTA 24 0 

GGCACGCAAC AAACCCCACC CACTCAAATT CTAGTCTCGC TCTCTTTGAT ATTGACTTTT 3 00 

TTTATCATGG AACCTAGCTT GAAAAAGGCT TATGATACAG GGATTAAGCC TTATATGGAT 3 60 

AAAAAGATTT CTTACACCGA AGCGTTTGAA AAAAGCACTC TGCCTTTCAA GGAATTCATG 42 0 

CTTAAAAACA CACGAGAAAA AGATCTAGCC CTTTTTTTTA GGATTAGGAA TTTGCCTAAC 4 80 
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CCTAAAACCC CTGATGATGT GAGCTTGAGC GTTTTAATCC CGGCATTTAT 

TTGAAAACAG CGTTTCAAAT CGGCTTTTTA CTCTACTTGC CTTTTTTGGT 

GTTATCAGCT CTATTTTAAT GGCGATGGGT ATGATGATGC TCCCGCCTGT 

CTGCCTTTTA AAATTTTGGT GTTTATTCTG GTGGATGGGT TTAATTTATT 
TTAGTGGCGA GTTTTAAAAT GGTTTAA 

(2) INFORMATION FOR SEQ ID NO : 2 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SOI base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
{ D) TOPOLOGY.: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



GATAAGCGAG 
GATTGATATG 
AATGATTTCT 
GACCGAAAAT 



540 
600 
660 
720 
747 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
{ B ) LOCATION 1 ... 501 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

TTGTTGGTTA CTTTTTTGAA TGGGTTTGAC CCAAAAATCG CTAATTTAAG GAAAGCGTGC 

AATGTTTATA GYGTGGGGGT GATTTATATT GTAACCACCA ACACGCTCAA TATTTTAAGT 

TGTGAGAGTT TTGAAATTTT AGAAAAAAGA GAGCTGGATA CAAGCGGCGT TACTAAAACT 

TCCACGCCGT TTTTTTCTAG GGTTGAGGGC ATTGATGCAG GCACGCTAGG GAAACTTTTT 

TCAGGCAGTC AATCTAAAAA TTACTTCGCT TACTATGACG CTTTAGTGAA AAAAGAAAAA 

CGAAAAGAAG TAAGGATTGA AAAGAAAGAA GAAAGGATTG ATGCTAGAGA AAATAAACGA 

GAAATCAAGC AAGAAGCCAT TAAAGAGCCT AAAAAAGCCA ATCAAGGCAC AGAAAACGCT 

CCCACTTTAG AAGAGAAAAS CTACCAAAAR GCAGAGCGAA AATTTG ACG C TAAAGRAGRA 

AGGMGATCGT TCAAGRGATG A 

(2) INFORMATION FOR SEQ ID NO; 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .381 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

ATGGAAAA C A GCACACTTTA TATTGTTATT GCCGGCTTAT GGCTTGCTGT AGGCTTTGGA 

ATCTTTTTAA AGAAATTAGA CATGCCCGTT ATCATTGGCT ACATTTGCAC AGGAACGGTC 

TTAGCGGCTT TTTTTAAAAT TAATGATTTT AATTTG TTGT CTGATATTGG TGAATTTGGT 

ATCGTCTTTT TAATGTTTAT GATAGGCATT GAGTTTAATT TTGACAAGCT CAAGTCCATC 

AAACAAGAAG TGCTCGTTTT TGGGCTTTTA CAGGTTGTTT TATGCGCTTT AATCGCTTTT 

TTATTGGGGT ATTTTGTTCT GGGTCTTTCG CCC ATTTTTT CCCTTGTTTT AGGCATGGGG 

CTTTCACTCT CTTCAACCGC C 



60 
120 
180 
240 
300 
360 
420 
480 
501 



60 
120 
180 
240 
300 
360 
381 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(iii MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY: misc.feature 

(B) LOCATION 1 ... 51 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO; 22 
TTGTTACTCA TGCTTAATAA GCCAAAGCCT TTATTTTTGM CTCTTGGTTA A 51 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1053 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

<ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 ... 1053 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 3 

ATGGCATTAA GGGTATTATT ATTCTTTTGT TTTTTGTTTT TGCAAGCAGA AGATAAGAGC 60 

CAAGAATTAT CATCTATACA AAAACAAATG GCTTTGGTGG ATAAAAAACT CGCCAAAGAC 120 

GATAACGTGT GGTTGAAAAA ATTTGAAAAC TATAAAATTT ACAACCAAAT TTATACTGAA 180 

AAAGAGAGCG TGAGGCAGGA ATTAAGGCGC TTAAAAAACA AAAAAAGCAA GGATTTATTA 240 

AAGATTAGCA CCTTAGAGCA TACCTTAAAG GCTTTAGAGT CCCAGCAAAA AATGTTTGAA 3 00 

AGCTATGGGG TCAATCCTTT TAAGGACTTG ATAGAGCGCC CCAATATCCC CAATATCCCT 3 60 

AATATCGCTA ACCCTATTGC GATCATTGAT GGCATTTCTT TCATCAAGAG CATGCGTTTA 4 20 

AAGCATGAAA ATCTTAAAAA TAACCAGACT TCTTTAGGAG AAGTTTTAAA GCTTTTAGAT 4 80 

CAAAAACACC AGCTTTTAAA TCAGTGGCAC GCTTTGGATA AAAGCGCGAA ATTAAGCGAT 54 0 

GAGATTTATC AAACTCAAGC CAAACGCTTA GAATTGCAAG GGGCTCAAAA CATTCTAAAA 6 00 

ACCACRATCG GGATTTTCCA AAAAGACAGC GATGAAGCTA TAAGCATTGT CAAATCTCAA 6 60 

GTTAAAAACC AGCTTTTTAA ATTGGTTTAT GTGTTTTTAG CGGCCCTTTT GAGCGTGGTG 72 0 

TTTGCGTGGA TTTTAAAAAT CATTTCCAGT AAATACATTG AAAATAATGA GCGCGTCTAT 7 80 

ACCGTGAATA AAGCCATTAA CTTCGTGAAT GTGAGCGTGA TCGKTKKAAT CTTKCTTTTT 84 0 

TCTTATTTAG AAAACGTTAC TTACTTGGTA ACGGTTTTAG GCTTTGCGAG CGCGGGCTTA 9 00 

GCGATTKCGA TGAAGGATTT ATTCATGAGC TTGCTCGGGT GGTTTATCAT TTTGATTGGG 9 60 

GGGAGCGTGC ATGTGGGCGA TAGGGTGCGT ATCGCTAAGG GGACGGATAT TTTTATTGGC 102 0 

GATGTGTTGG ATACTTCTAA TGTTGTACAT TAA 1053 

(2) INFORMATION FOR SEQ ID NO: 24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(:x) FEATURE: 

( A ) NAjriE / KEY : misc_feaLu:e 

(B) LOCATION 1 . . . 300 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

ATGAAAGAAC AAGAATGGGA TTTAAGCGCT TTATTTGAAA ATAAAGAAAG CGCAGAAGAA 6 0 

T tt-t:^h CCTTACAAAC AGAAGTGCAA GAATTTGAGA ACGCTTATCA AAATAACCTT 12 0 

AAGAA^T^G ACGCTGCAAA ATTTGCCAAC ACTCTTAAAC ATTACGAAAA TTTGTCAGAA 180 

AAG^C^CTA GAGCGATGGC TTACGCCAAT TACTTTTTGC CAAGAACACT AAAGAAGCGA 2 40 

AG^TVTTC GCAATGCAAA TGGCTTGTGC AAATATCCAA CAACACCTTT TATTCTTTGA 3 00 



{2) INFORMATION FOR SEQ ID NO: 25: 

{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 7 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
{ D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . - 237 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

TTGCGCGTGG GCATGTATGA AGTGTGTAAC CATAAAGACG GCACCGCTTA TCATTCCACA 6 0 

AGAGG-TCTA AGGTTACCTT AGCGTGTAAA ACCGGCACCG CGCAAGTCGT AGAAATCGCT 12 0 

C AAAAC L T CG TCAATCGCAT GAAAGAAAAG GATATGGAAT ATTTCCATCS MTCCCATRCG 180 

TGGATT ACGR CATATCTTTR CCCTATGAAA AACCCAAATA CGCTATCACT ATTTTAG 



237 



(2) INFORMATION FOR SEQ ID NO : 2 6 : 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv} ANTI -SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .159 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 6 

TTGGGTTTGG TGWRCGGRAT TTCTCTCTTG CATTTGAGTT TGGAGCAAAA AATCAGCGTG 6 0 

TTTCTTGGRC YCAATTTAAT GCTTTAYCCG GTCAYAGAGG TGCTTTTTAG TATCCTTAGG 12 0 

CGCAAAATAA AACGCCAGAA AGCCACCCAT GCCGGATAA 159 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

{ /\ ) JjLNGTH : 113 4 base pairs 
(B) TYPE: nucleic acid 
( C } STRANDEDNESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : miscjeature 
(3) LOCATION 1...1134 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

TTGGCTCAAC CGGTTCAGGT AAGAACAGTG TTTATGTCCA TGAC TCTAAA CGCTATGGGG 60 

CAATTTGCCT ATAATTTTCC TGCTAATATC AGCAAAGACA AGCAAAAGCT C AC T ATGGTT 12 0 

TATATGGATA AAGATTATGG CGCTTATGGG AATATTGTTG CAATGGGTGG GGAGTATGTC 180 

AAGATTGAGC TAGGGACAGA TACAGGATTA AATC CTTTTG CTTGGGCAGC TTGTGTGCAA 24 0 

AAAACAAATG CAACAATGGA GCAAAAACAA ACAGCTATTT CTGTTGTCAA AGAGCTTGTG 3 00 

AAAAACTTAG CAACTAAAAG CGATGAAAAA GATGAAAATG GCAACAGCAT CTCTTTTAGC 3 60 

CTAGCAGATT CTAATACGCT TGCAGCGGCA GTAACCAACC TTATCACAGG AGATATGAAC 42 0 

CTAGATTATC CTATCACTCA ACT TATT AAT GCTTTCGGGA AAGACCACAA TGATCCTAAT 480 

GGGCTTGTCG CGCGATTAGC GCCTTTTTGC AAATCAACCA ATGGTGAATT TCAATGGCTT 54 0 

TTTGACAATA AAGCAACAGA TCGCTTAGAT TTTTC AAAAA CGATTATTGG CGTTGATGGG 600 

TCAAG TTTCT TAGACAATAA TGACGTTTCG CCTTTTATTT GTTTTTACCT TTTCGCTCGT 660 

ATCCAAGAAG CAATGGATGG GCGTAGATTT GTCTTAGATA TTGATGAAGC GTGGAAATAT 72 0 

TTAGGCGATC CAAAGGTCGC TTATTTTGTG AGAGACATGC TAAAAACTGC AAGGAAAAGA 7 80 

AACGCTATTG TTAGACTTGC GACTCAAAGC ATCACTGATC TTTTGG CTTG CCCTATTGCT 84 0 

GATACGATTA GAGAACAATG CCCTACAAAG ATTTTTTTGA GAAACGATGG GGGTAATCTT 900 

TCTGATTACC AAAG ATT AG C CAATGTTACA GAAAAAGAAT TTGAAATCAT CACTAAGGGG 9 60 

CTGGATAGGA AAATCCTCTA CAAACAGGAT GGAAGCCCTA G CGTTATCGC TAGTTTTAAT 102 0 

TTGAGAGGCA TTCCTAAAGA ATATTTGAAA ATTTTATCCA CAGATACTGT ATTTGTCAAA 1080 

GAAATTGACA AGATTATCCA AAACCATAGT ATCATAGATA AATATCAGCC TTGA 1134 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 65 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_ feature 

(B) LOCATION 1 ... 465 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

ATGAAA^GG TGAGTC^AT TGTAGCGTTA GTTTTTTGTT GTTTTTTAGG GGCTGTAGAG 60 

TTGCCTGGAG TTTATCAAAC TCAAGAATTT TTATACATGA AAAGCTCTTT TGTGGAGTTT 12 0 

TTTGAGCATA ACGGGAAGTT CTATGCCTAT GGTATTTCTG ATGTGRATGR CTCTAAAGCC 180 

AAAAAAGAC^ AACTCAATCC TAACCCAAAG CTAAGGAATC GCAGCGATAA AGGCGTGGTG 240 

r r r p r r r r T AAGCG ATTTGATTAA GGTTGGGGAA CAATCTTATA AAGGCGGTAA GGCGTRTAAT 3 00 

TTTTRTGACG GCAAGACCTM CCATGTGAGA GTCACTCAAA RTTCAAACGG GGATTTGRAA 3 60 

"TCACTTCAA GCTATGRCAA ATGGGGGTAT GTGGGCAAAA CCTTCACCTG GAAACGCCTG 

AGCGATGAAG AAATCAAAAA TCTAAAGCTC AAGCGTTTTA ACTGA 4 65 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 ... 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
TTGGAGACCC TATTCTTGGT ATAG 24 
(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 ... 345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 
ATGSACACAC ACGACAGGCG CAAGTTAAGA ATTARCCTTA CACAAACGAC GACTTTAGTG 60 
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GCCACTATTG GCTCAAACGC CCCTTATATC GGTCTTTTAG GGACGGTTAT GGGGATCATG 

CTCACCTTTA TGGATTTAGG CTCAGCTTCT GGCATTGACA CTAAGGCGAT CATGACTAAT 

TTAGCCCTTG CTTTAAAAGC GACCGGCATG GGGTTATTGG TAGCGATCCC TGCGATTGTG 

ATTTATAACT TGTTAGTGAG AAAAAGCGAG ATTTTAGTTA CCAAATGGGA TATTTTCCAG 

CATCCGGTTG ATACGCAATC CCATGAGGTT TATAGCAAAG CCTAA 



120 
180 
240 
300 
345 



(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL; NO 

fiv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc__f eature 

(B) LOCATION 1 . . .204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 

ATGCAGGATT TAGACAATAA CATGTCCTTA GACACCGCTC ACAACACGCT GAGTTCTAAC 6 0 

GGGAAAAACA TCACCATTGC CGGGGTGGTA AAAGCCTTAC AAAAAATTGG CGTGAGCGCT 12 0 

AAGGGGATGG TTTCAATCTT GCAAGCCCTA AAAAAAAGCG GCGCGATTAG CGCGAAATGG 18 0 

AGATACTATG ATAAACAACA ATAA 2 04 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

{ iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ixi FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .267 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 2 

TTGCACCCTT TAGCGGATGT CTTTGTGGTG AATGACAAAC GGYCCGTTTT AGCGATGGTA 60 

RCGATGTTGA TTGRCTCGTT AGCGAATATC TTTTTCAATT ACTTGTTTAT TTTTGKGTTG 12 0 

GAAGTGGGGG TTCAAGGCAG MGCGATAGTC ACCGTGATAG GGCATGCGAT AGGGGGTTTA 180 

GTCTTAATGC AGCATTTTTG GCGCAAAAAA GGGGAGTTGT ATTTTATCAA ACTGATTTTC 240 

TTT ATCTTC A GTCATTTCTT CAGCTAA 2 67 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc.feacure 

(B) LOCATION 1 ... 831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 



ATGTTAAUGA 
AGCTTGCCCC 
ATCTTGCTCG 
GTTTTAGCGG 
GCTAGTTTTT 
TTGTATGGTT 
ATTAAGGAAA 
TTGGGCATGA 
CTGTATGTCG 
AAAGAAAAAG 
AAAGGCAGTC 
TTGATTACGC 
CAAGGCGTTA 
CCTCATTCAT 



AAAACATTT-r 
ATGCGATTTT 
TGCAAACCTT 
ATTTGATGAG 
CGTTGGTGCT 
TG TAT AG C G C 
ACAAAAAAGA 
TTATAGGGAG 
TGGGGATTTT 
AAGGGGATTT 
TTAAAGAGCT 
CTG TCTTTTT 
AAGAGCAATA 
TTTTTAAAAG 



AGCTTACTAT 
AACCCCCTTG 
TTTT AG TTTT 
CCGGAAGAAT 
GTTTTTTGAT 
ATGCTCTAGC 
TTTATCCAAG 
TTCTTTGGGA 
TTTAATCATG 
TAAAAGCCAA 
TAAAGATAAC 
TATGAGCCAT 
CCTTTTTGTG 
CCAAAAATTA 



TTGCTTTCTA 
TGCGTGCTGG 
TTATTCCTGG 
AGTTTTATCC 
GGCACGATTG 
TTTT TAG CCA 
TCGTTTTTGT 
CTCTGTGCGC 
AAAAATTTGA 
CCCAAGCTTA 
TTCCAAATGT 
TTCTATATCG 
KCAGCCAAAA 



TTCTCTTAAT 
AAGGGCTTAG 
TGGCTGAATA 
TTTCTAATGT 
TCATGCTTTT 
AAGCTTCACT 
AAAACAATCA 
ATCTCAAAGT 
TAACAATCAT 
AACTCCTTAA 
AAATTTTGTT 
GGCAAGCGTA 
CTTTTCAAGT 
AATCGCCCTG 



TTTGAGTGAA 
CCCAAGCGGC 
GTTTTTAATC 
AGCGTGGGGG 
CATCACAGAC 
AATTACTTAT 
CCATGCGATG 
CATTTATTTT 
AGAGCAAGTC 
AGTGGGGCAT 
TTTTTTAAAA 
GATTTCCATC 
A 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
831 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 282 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .282 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 4 

TTGTATCCGC CGGGATCTGT GGTTAAAATG GGCGTGGGGT TAAGCTTTTT AGAAAACCTT 

CATATCACAG AAAACACCAC TATCCCCACA CCGCCTTTTA TTGAAGTGGG CAAGCGCAAA 

TTCAGGGACT GGAAAAAAAC AGGGCATGGC AATTCTAATT TGTATAAAGC CATTAGGGAG 

TCCGTGGATG TGTATTTTTA TAAGTTTGGG CTTGAAATCT CTATAGAAAA MCTCTCTAAA 

MCCTTTAAGG RAAGTGGRCT TTGGGGAAAA AACGGRCGTT GA 

(2) INFORMATION FOR SEQ ID NO : 3 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



60 
120 
180 
240 
282 
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<ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_fearure 

(B) LOCATION 1 . . .183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 

ATGGCACACC ATKAAGAACA ACACGGCGGG CACCACCACC AYCACCACCA CACACACCAC 6 0 

CACCATTATC ATGGCGGCGA ACACCACCAT CACCACCACA GYTCTCATCA TGAAGAAGGT 12 0 

'i'G'ri'UL'AUL'A CTAUCGATAG TCATCATCAA GAAGAAGGTT G TT G'iChCGG GYAYCACGAG ISO 

TAA 183 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.f eature 

(B) LOCATION 1 . . . 894 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 

TTGGTTAAAA TAAGGTTATT TGATTTTACT ATAAGGTTGT TTAAACCTGA ATTTCACATT 60 

TTTGATTTTT TAAAAGGGAT TAGAGTTCTT ATGATTGAAT GGATGCAAAA TCATAGAAAG 12 0 

TATTTAGTGG TTACGATATG GATAAGCACG ATCGCTTTTA TTGCCGCCGG AATGATAGGT 180 

TGGGGGCAAT ACAGCTTTTC TTTAGATAGC GATAGCGCTG CCAAAGTGGG ACAGATTAAG 240 

ATTTCTCAAG AAGAATTAGC CCAAGAATAC CGCCGCCTTA AAGACGCCTA TGCTGAGTCT 3 00 

ATCCCTGATT TTAAAGAACT CACCGAAGAT CAAATCAAAG CCATGCATTT AGAAAAAAGC 3 60 

GCGC TAGATT CGCTCATCAA TCAAGCTTTA TTGAGGAATT TCGCTTTAGA TTTAGGGCTT 42 0 

GGTGCTACCA AGCAAGAAGT GGCCAAAGAG ATCAGAAAAA CGAACGTTTT TCAAAAAGAT 4 80 

GGCG TTTTTG ATGAAGAATT GTATAAAAAT ATCTTAAAAC AAAGCCATTA CCGCCCCAAG 54 0 

CATTTTGAAG AAAGCGTTGA AAGGCTTTTA ATCCTTCAAA AAATCAGCGC TCTATTCCCC 600 

AAAACCACCA CCCCTTTGGA GCAATCCAGT CTATCGCTTT GGGCAAAATT GCAAGACAAA 6 60 

TTAGACATTC TTATCCTAAA TCCTAATGAT GTTAAAATCT CTCTCAATGA AGAAGAGATG 7 20 

AAAAAATATT ATGAAAACCA TAGAAAGGAT TTTAAAAAGC CCACAAGCTT TAAAACACGC 7 80 

TCTTTATATT TTGACGCTAG TTTAGAAAAA ACTGATTTGA AAGAGTTGGA GGAATACTAC 84 0 

CATAAAAACA AGGTGTCTTA TTTGGACAAM AGMGGGGAAA TTACAGGATT TTAA 89 4 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 
<B) LOCATION 1 . . . 273 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

ATGGTTAAAC ACTATCTTTT CATGGCGGTT TCGCAGGTCT TTTTCTCCTT CTTTTTAGTG 60 

CTGT^TTTTA TCTCTTCCAT TGTGTTATTA ATCAGTATTG CAAGCGTAAC GCTCGTGATT 12 0 

AAAGTGAGCT TTTTGGATCT GGTGCAACTC TTTTTG TATT CCTTGCCWGG AACCATTTTT 180 

TTTATTTTGC CGATYACTTT TTTTGCGGCT TKGCGYTTGG GGSTTTCAAG GCTTAGCTAT 240 
G AC C ATGAAT TGTTAGTGTT TTTTCTCCTT TAG 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .261 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 

ATGAGTAAAA GAGCGATCCG TTTCCCTAAC AAGCTTTTTT CATACCCTAA ACCCAAAATA 60 

AAAGCGACAA ACACAAGCCA CACCGTTTTA TTCGC AT AC C CGCTCAAACC CCACGAAATG 12 0 

GCCTTATTAG CGCTCGCTAC CTCACTGCTC GCTCCAATTT TTAACGCTAT ACACAGCACT 180 

AACGCGCTCA ACGCTATCAA ACCTGATGGC ACCGGCTCTA AAATTAACCC TATAATCATG 240 

CCCATGAAAA TACAAAAATA A 261 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . . 426 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 

GTGTATAGCC GTTTTTTTGC CAACCAGCAT GAATTTGACT TTGAAGCTCA AGGGGCGCTA 60 

GGGAGCGATC AATCAAGCTT GAATTTCAAA AG T ACTCT AT TACAAGATTT GAATCAAAGC 12 0 

TATAATTACT TAGCCTATAG CGCCACAGCA AGAGCGAGTT ATGGTTATGA CTTCGCGTTT 18 0 

TTTAGGAACG CTTTAGTGTT AAAACCAAGC GTGGGCGTGA GCTATAACCA TTTAGGTTCA 24 0 

ACCAACTTTA AAAGCAATAG CCAATCACAA GTGGCTTTAA AAAATGGCGC GAGCAGTCAG 3 00 

CATTTATTCA ACGCTAACGC AACGTGGAAG CGCGTTATTA TTATGGGGAC ACTTCATACT 3 60 

TTTATTTGCA TGTGGGAGTT TTACAAGAGT TCGCTCACTT TGGATCGAAT GATGTGGCGT 42 0 

CTTTAA 426 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic} 
(iii) HYPOTHETICAL: NO 

<iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE; 

(A) NAME /KEY : misc_f eature 
.(B) LOCATION 1 . . . 558 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

ATGYATGAAA ATGGTAGGGG TGTAC CT AAA GATTACAAGA AAGCGGTTGA ATATTTCCAA 6 0 

AAAGCTGTTG ATAACGATAT ACCTAGAGGG TATAACAATT TGGGCGTGAT GTATAAAGAG 12 0 

GGTAAGGGAG TTCCTAAAGA TGAAAAGAAA GCGGTGGAAT ATTTTAGAAT AGCTACAGAG 180 

AAAGGTTATA CTAACGCTTA TATCAACTTA GGCATCATGT ATATGGAGGG CAGGGGAGTT 24 0 

CCAAGTAACT ATGCGAAAGC GACAGAATGT TTTAGAAAAG CGATGCATAA GGGCAATGTG 3 00 

RAAGCTTATA TTCTCCTAGG GGATATTTAT TATAGCGGAA TGATCAATTG GGTATTGAGC 3 60 

CGGACAAAGA TAAGGCTGGT C C ATT AT AAA ATGGCGGCCG ATGTRAGTTC TTCYAGAGCY 42 0 

TATRAAGGGT TGTCAGAGTC YTATC SGTAT GGGYTAGGCG TGGAAAAAGA KWAAAAAAAG 48 0 

GCYGAAGAAT ACATGCAAAA AGCATGCGAT TTTGACATTG ATAAAAATTG TAAGAAAAAG 54 0 

AACACTTCAA GCCGATAA 558 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . -420 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 
TTGCTCAACA TGTGGGATGA AGCCAAGAAA GAAGGGATCA ACATCAATAC AGAAAAGCTC 6 0 
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TCTCAAGAAT TGGGGGTTGT GTGCGTGCCA ACAAGCGCGA 

AACACAGAGC TTTTATTAGA CGAAATTGTC AGGCTTTATT 

GAAAACATCA AAGTCCCATC TCAAAGTTTT AAAGAGTCTT 

CAAAGAATCG CTAAATCAGT GATCAGTGAA AACAAACAAA 

TATAAGATTG ATAAGATTTT TAATGCACCA GCGTTATGGG 

TATGTTTATC ATCTTTTCTT TGAGCTTTTT AATAGGAGGG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 532 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
i i i i ) HYPOTHETICAL : NO 
( iv) ANTI-SENSE: NO 



GATYCAAAGA 
CTCAAAACAC 
TAAAATACAG 
ATGCGAGTTT 
ATTTTCATTT 
GGAGTGCRAA 



AGATCGCTTG 
TACAAACAAC 
CCAGAGCGCC 
TG AACACACT 
TTTTDGGGTT 
AAGCCCTTGA 



120 
180 
240 
300 
360 
420 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

{A) NAME /KEY : misc_feature 
{B) LOCATION 1 ... 582 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 

ATGCAAGAAG CGTTGTTGCG TTTTCAAGAG GGCTTTAAGG AGTGGGGTTA TCTTATTTTA 60 

TTTTTGTATT CTTTGGGGGG TGGGTATGTA GGGATTGTCA TCGCTTCCAT TTTGAGCGCT 12 0 

ACCACGCACG CTTTGGATAT AAAAATAACC ATTCTTGTCG CTTTTTTAGG GAATTTAATA 180 

GGGAGTGGGG CTCTTGTAAT CTTTGCCCGC TATCAAAAAA GAGAGTTTTT AAAGTATTTC 2 40 

CAAAAGCATA GAAGAAAGCT TGCTTTGGCG AGTTTGTGGG TGAAACGCTA CGCCTTGCTC 3 00 

ATGATTTTTG TCAATAAATA TCTCTATGGG ATTAAAAGCG TTGTGCCTTT GGCAATTGGT 3 60 

TTTAGCAAAT AC CCTTT AAA AAAG TTTTT A TGGCTTAATG TTTTTTCCAG TTTTTTGTGG 42 0 

GCGTTAATCG TGGGGAGCGT TTCTTTTCAA GCGAGCGATT GGGTGAAAAC GCTGTATGAA 4 80 

AGGCTTTCTC ATTACACTTC GTTTTTTGTC ATAAGTTTTG TTCTTATAGC GCTTTTAATA 54 0 

TGGTTTTTAT TGAAACGATA TTCGCGCAAA ATGGGKTTTT AA 5 82 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . . 390 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

ATGCGAAAGG GGCGTGTGAT GTTATGCGTG TTTGATATAG AAACCATTCC TAATATAAGC 6 0 

TTGTGTAAAG AGCATTTTCA ATTAAAAGAA GACGATGCGC TAAAAATCTG TGAATGGAGT 120 

TTTGAAAAGC AAAAAGAAAA AAGCGGGAGC GAGTTTTTGC CCCTTTATTT GCATGAAATC 180 

ATCTCTATTG CAGCMGTCAT TGGCGATGAT TACGGGCAAT TTATCAAAGT AGGGAATTTT 2 40 

GGTCAAAAAC ACGAGAATAA AGAGGATTTT GCGAGCGAAA AAGAGCTTTT AGAGGACTTT 3 00 
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TTC AAA TACT TTAACGAAAA GCAACCGCGC CTAATAAGCT TTAAWGGCAG GGGTTTTGGA 
TATTCCCCTA CTCACGCTCA AAGCCCTTAA 



360 
390 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . . 924 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

ATGGCTAAAA AGAAAATTGC GATCAGCTGT GGGGATATTC AAGGCGTAGG CTTAGAATTG 60 

ATCTTAAAAA GCCATAAGGA AGTGAGTGCA CTTTGTGAGC CGTTGTATCT CGTTCATAGC 12 0 

GAACTTCTAG AACGAGCCAA TCAATTGCTT GATAACGCTT ATGAAACTAA AACGCTTAAT 180 

GCGATCGCTA TTGATGCCCC TTTACCCTTA TTAAACTCTA GCACGATAGG CAAAGTCAGC 240 

ACTCAAAGCG GG G CG TAT AG C TTTG AG AG T TTTAAAAAGG CTTGCGAGTT GGCGGATAGT 3 00 

AAAGAAGTGG ATGGCATTTG CACTTTGCCT ATCAACAAAC TCGCATGGCA ACAAGCTCAA 3 60 

ATCCCTTTTG TGGGGCATAC CGATTTTTTG AAACAACGCT AC AAAG ATC A TCAAATTATT 420 

ATGATGCTTG GGTGTTCAAA ACTCTTTGTG GGGCTATTTA GCGACCATGT GCCTTTAAGC 4 80 

GCGGTTTCTC AACTCATTCA AGTGAAAGCG TTAGTTAAGT TTTTATTAGC GTTTCAAAAA 54 0 

AGCACTCAAG CCAAAATCGT TCAAGTGTGT GGTTTCAACC CCCATGCGGG CGAAGAGGGA 600 

TTGTTTGGGG AAGAAGATGA AAAGATTTTA AAAGCCATTC AAGAGAGCAA CCAAACGCTA 660 

GGTTTTGAAT GCTTTTTGGG GCCACTGCCC GCTGATAGCG CTTTTGCCCC CAATAAACGC 72 0 

AAAATAACCC CCTTTTATGT GAGCATGAGC CATGATGTAG GGCTAGCCCC TTTAAAAGCG 7 80 

CTCTATTTTG ATGAAAGCAT CAATGTGAGT TTGAACGCTC CCATTTTACG CGCTTCCACT 84 0 

GACCACGGCA CGGCGTTTGA TATTGCTTAT CAAAATAAGG CGAACCATAA AAGCTATTTG 900 

AACGCGATCA AATACTTGGC TTAA 9 24 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 40 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : mis c_f eature 

(B) LOCATION 1 . . . 44 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

ATGAGCAGCG GGTTAATTTA CATTTCRTTA GAAGTCTTGG TARCGTGTTT GATCACCGCT 6 0 

CTAATCATGT ATTATGTGAT GAAAAAGATC TATTACGCTA GAGGGCAAGC CATTTTAAAA 12 0 

GGCGCTTCAG CCAAAGCTAA ATTAATGGAA TTTCAAGCGA AATCTTTCGT GGAAGCTGAA 180 
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GAAATGCGCA TGAAAAGCCA AG AATGC AAG TTGCAACAGC AATATGAAAA TAAGAATTTG 24 0 

CAACTCCAAA CCCATTTTGA TAAAAAAGAA GCGCATTTGA AGCATTTAGA AGCGCAGCAC 3 00 

AAAGAATTTG TAAGAGATGA AAAACGCTAT TTGGAAAAGG AAAAAAAAGA GCTTGAAAAA 3 60 

GAACGCCAAA TTTTAGAAMM AGAGAGGGAA AATTTTRRRR RSCAGCGCGC CTTTGTRRRG 42 0 

RRGSCTCRRG CCAAAGCGCT 440 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 384 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix) FEATURE: 

(A) NAME/ KEY : misc_teature 

(B) LOCATION 1 ... 334 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 46 

ATGAATATCA AAATTTTAAA AATATTAGTT GGAGGGTTAT TTTTTTTGAG CTTGAACGCC 6 0 

CATTTATGGG GGAAACAAGA CAATAGCTTT TTAGGGATTG GTGAAAGAGC CTATAAAAGC 12 0 

GGGAATTATT CTAAAGCGGC GTC TTATTTT AAAAAAGCAT GCAACGATGG GGTGAGTGAA 18 0 

GGCTGCACGC AATTAGGAAT CATTTATGAA AACGGGCAAG GCACTAGAAT AG ATT AT AAA 2 40 

AAAGCCCTAG AATATTATAA AACCGCATGC CAGGCTGATG ATAGGGAAGG GTGTTTTGGC 3 00 

TTAGGGGGGC TTTATGATGA GGGTTTAGGC ACGGCTCAAA ATTATCAAGA AGCCMTTGAC 3 60 

GCTTACGCAA GGCATGCGTT TTAA 384 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

{ ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

ATGGCGATAG CCATTAAGGA TTTATTGAGC GCTTATAAAG TCGTTTTACC TTTGGATAAA 60 

ATCAGCATGC CATCTAGCGC GGATTTGAAG CTCACTTTGC AATTCTTAAA AAACACCGCC 12 0 

CCCTTATTTA GCGTTCAAGG CAGCGTTAAT TTGCAAGAAG GCACTTTCTC GCTCTATAAT 18 0 

ATCCCCCTTT ACACGCAAAG CGCTCAAATC AATTTGG AC A TCGCCCAAGA ATACCAATAC 24 0 

ATCTACATAG ACACGATCCA CACGCGCTAT GCAAACATGC KGGATTTAGA CGCTAAAATC 3 00 

GCTTTAGATT TAGGTCAAAA AAACCTYTCY YKGGAKKCYY TAGGKCCATA A 351 

(2) INFORMATION FOR SEQ ID NO: 48: 
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{!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

( iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc.feature 

(B) LOCATION 1 ... 351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 

TTGAAAGTAA CRAATCCGCA TTTGTTGGTG GTAATCCAAG ATTTAAACGC TCGCATCGCT 60 

TTAATGAAAC TCTTATTCCA AAACGTTAAG AGCGCGAACA AAGAATTGGT TTTTTGCAAT 120 

AAAGAAAAAC GCTTGATAAG GTCTTTTGAT GCACAAAAAG AATACGGCAT CACGCCTGTA 180 

GAAAATATTT TAAGCGTTTT AGACACCGCT ATGAATCCTA ACAGCGCGCT TGTGATAGAC 2 40 

AATCTCAACG AAGCGAAAGA ATTGCACGAC AAAGTAGGGG CGGAAAAGTT AAAATCGTTT 3 00 

TTAGAAAAAG CCCMTAGACA ACGAGCAGTA TTGCGTCATT TTTGCGCATG A 351 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 




(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 48 



ATGCCGGATA AYTTGCATTT GCACACCCTT TTATYTAAAT TCTTGCAACA ACGCTCTTTC 
AATTACCCTA ACCCTTTATG CGCGTTTATC CTTATTCTAT GCAACCTGCC TTTTATTTTA 
ATAAGCGTTT TGTTTCGCTT GGACGCTTAT GCGCTCATTG TGATTAGCCT AGTCTTTATC 
RCATGCTATT TAATAGGCTM TGCTTATTTG AATAGGCAAG TTTGCGCTTT AGAAAAGCGG 
GCGTTTTAA 



60 
120 
180 
240 
249 
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(iv ) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 ... 597 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 0 

ATGAAAGAAT CTTTTTACAT AGAGGGAATG ACTTGCACGG CGTGTTCTAG CGGGATTGAA 60 

CGCTCTTTAG GACGTAAAAG TTTTGTGAAA AAAATAGAAG TGAGCCTTTT AAA.TAAGAGC 12 0 

GCTAACATTG AATTTAACGA AAATGAAACC AATTTAGACG AAATTTTTAA ACTCATTGAA 180 

AAACTGGGTT ATAGCCCTAA AAAAACTCTA GCAGAAGAAA AAAAAGAATT TTTTAGCCCT 2 40 

AATGTTAAAT TAGCGTTGGC GGTTATTTTC ACGCTTTTTG TGGTGTATCT TTCTATGGGG 30 0 

GCGATGCTTA GCCCTAGCCT TTTACCTGAA AGCTTGCTTA CGATTAACCA TCATAGTAAT 3 60 

TTTTTAAACG CTTGCTTAC A GCTTATAGGC GCGCTCATTG TCATGCATTT GGGGAGGGAT -* ^ ^ 

TTTTACATTC AAGGGTTTAA AGCCTTATGG CACAGACAAC CCAACATGAG CAGCCTTATC 4 80 

GCCATAGGCA CAAGTGCTGC CTTAATTTCA GCCTGTGGCA ATTGTATTTG GTTTATACCA 54 0 

ATCATTATAC CGATCAGTGG TCTTATGGGC ATTATTATTT TGAAAGCGTG TGCGTGA 5 97 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : double 
(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

{ix) FEATURE: 

{A) NAME /KEY : mi sc_f eature 

(B) LOCATION 1 . . .258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

GTGGGGATTG TGCCGGATAA TTTGTGGAAG CTCAAACGCT TCAATCAAGA CTGGCGCGTT 6 0 

GGGGACACGC TCATTACTGC TATTGGGCAA GGCTCTTTTT TAGCCACGCC TTTGCAGGTG 12 0 

TTAGCCTACA CAGGACTCAT TGCGACAGGC AAACTGGCAA CGCCTCATTT TGCTATCCAT 18 0 

AACCAACAAC CGCTCAAAGA CCCCCTGAAT AGGTTTTCAA AAAAAGAAGC TCCAAGCCTT 24 0 

GCGCGTGGGC ATGTATGA 2 58 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 
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(A) NAME /KEY: misc.feature 

(B) LOCATION I ... 1032 

Exi) SEQUENCE DESCRIPTION : SEQ ID NO: 52 

ATGCAAAACT TATTGATACA AGCAGAAAAT GCAATCGCTC TACTTTTTTT GTTAAATGAC 60 

AAAAA C CT AA AAGGAAAAAT AG ATT TG AT A TATATTGACC CTCCATTTGC TACAAACAAT 120 

CATTTTACTA TCACAAATGG TAGAGCAACC ACAATTAGCA ATTCTAAGAA TGGCGATATT 180 

GCTTATAGTG ATAAAGTAGT GGGTATGGAT TTTATGGAAT TTTTAAAACA ACGCCTGGTA 24 0 

TTGCTTAAAG AATTGCTTTC AGAACAAGGC TCTATCTATG TGCATACAGA TTACAAGATA 3 00 

GGGCATTATG TCAAGGTAAT GTTAGATGAA ATATTTGGCA TACAAAATTT TAGAAATGAA 3 60 

ATCACACGCA TAAAGTGCAA TCCTAAAAAT TTTAAAAGGA TAGGCTATGG TAACATAAAA 420 

GATATGATTT TATTTTACTC TAAAGGAAAA AATCCCATTT TTAACGAACC TAAGATCCCT 4 80 

TATACGCCAC AAGATTTAGA AAAACGATTC CCTAAAATTG ACAAAGATAA AAGGCGTTAC 54 0 

ACTACCGTTC CAATACATGC TCCAGGAGAA GTGGAAAGTG GCGAATGTTC TAAAGCATTT 600 

AAAGGT ATG C TACCTCCAAA AGGGCGGCAT TGGCGCACTG ATATTGCCAC ACTTGAGCGT 66 0 

TGGGATAAAG AAGGTTTGAT TGAGTATTCT AACAATAATA ACCCTAGAAA AAAAATTTAT 720 

CCAAGCTACC CTACAGAAAA AAACGCTCAA TTATTAGACT TAATCATTAA AACCTCTTCT 84 0 

AATAAAGATA GTATTGTTTT AGATTGTTTT TGTGGTTCTG GAACAACCTT AAAATCTGCG 900 

TTTTTATTGC AACGAAAATT TATAGGCATT GATAATTCCG ATTTG GCTAT CCAAGCTTGC 96 0 

AAAAACAAGC TTGAAACAAT AACAAAAGAC TTGTTTGTTT CTCAAAATTT TTATGATTTC 102 0 

CTTGTTTTTT AA 103 2 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

<ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_ f ea tur e 

(B) LOCATION 1 ... 531 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 

ATGACAAGCG TTGTCATCAA GCCCCATGCC TATGGCGAGC AAGTCCAAGA AATAGAAGAA 60 

GAGTCAGATA GCGATTATGA AAAGAATAAC GACCAGGAAG CGATCAATTT TGGTATCGCC 12 0 

TTGCATAAGG GATTGGAATA CCAATACGCT TACAACATTC CTAAACAAAG CGTTTTAGAA 180 

TATTTAAACT ACCACTATGG TTTTTATGGT TTGGATTACC AAGCG TT AG A AGAAAGTTTA 2 40 

GAGCTTTTTG AAAACGATGC AGGGATACAA GCCCTTTTTA AAAATCATGC CTTAAAGGGT 3 00 

GAAGCGGCTT TTTTATTCCA AGGGGTTGTG TCTAGGATTG ATGTTTTATT GTGGGATAGA 3 60 

GGGCAAAATT TGTATGTTTT AGATTATAAA AGCTCTCAAA ATTACCAGCA AAGCCATAAA 420 

GCGCAAGTGT CTCATTACGC TGAGTTTTTG CGAACTCAAG SCCCCCATTT TAAGATACAA 480 

GCGGGCATTA TTTACGCTCA T AAAAG AC TG CTTGAAAAAT YATGGGKTTG A 531 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 783 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE : 

(A) NAME / KEY : misc_ feature 

(B) LOCATION 1 . . .783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 54 

ATGTCTGAGG ATTTGCCTTT TGCGAGCGAT TCGCAATTCA CTTACAATGG GGTGAGCATC 60 

ACGCGCCCCA CTAATGAGGT CAATGATGTG ATCAGCGGGG TTAATATCAC TTTAGAGCAA 12 0 

ACCACAGAGC CTAATAAGCC TGCCATTATC AGCGTGAGCA GAGACAATCA AGCCATTATA 180 

GACAGCCTTA AAGAATTTGT CAAAGCCTAT AATGAGCTTA TCCCTAAACT AGACGAAGAC 24 0 

ACGCGCTATG ACGCTGACAC TAAAATCGCC GGGATTTTTA ACGGCGTGGG CGATATTCGT 3 00 

GCCATTAGAT CCTCTCTTAA TAATGTGTTT TCTTATAGCG TGCATACGGA TAATGGGGTA 3 60 

GAAAGCTTGA TGAAATACGG GCTTAGTTTA GACGATAAGG GCGTGATGAG TTTGGATGAA ^ ^ 

GCTAAATTAT CAAGTGCATT AAATTCTAAC CCTAAAGCGA CTCAAGATTT TTTCTATGGG 4 80 

AGCGATAGCA AGGATATGGG GGGCAGAGAA ATCCACCAAG AGGGCATTTT TTCTAAATTC 540 

AATCAAGTCA TCGCTAACCT CATAGATGGA GGGAACGCTA AATTAAAGAT TTATGAAGAT 6 00 

TCCCTAGACA GAGACGCTAA AAGCCTGACC AAAGACAAAG AAAACGCTCA AGAGCTTTTA 6 60 

AAAACCCGCT ATAACATCAT GGCGGAGCGC TTTGCCGCTT ATGATAGTCA AATCTCTAAA 7 20 

G CC AATC AAA AATTCAATTC CGTGCAAATG ATGATCGATC AAGCAGCGGC TAAAAAGAAT 7 80 

TAA 783 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

< ii ) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 ... 438 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 

ATGCGCATCG TATTTATGGG AACGCCTAGT TTTGCTGAAG TGATCTTAAG GGCGTTGGTT 6 0 

GAAAATGAAG ATAAAAAGAT AGAAGTGGTG GGGCTATTCA CTCAAAGGGA CAAACCTTTT 12 0 

GGGCGCAAAA AAGAATTGAA AGCCCCAGAG ACTAAAACAT ACATTTTAGA AAATCATTTA 180 

AATATTCCCA TTTTCCAGCC GCAAAGTTTG AAAGAGCCTG AAGTTCAAAT CTTAAAAGGT 240 

TTGAAGCCTG ATTTTATCGT GGTGGTGGCT TATGGTAAGA TTTTGCCTAA AGAGGTTTTA 3 00 

ACCATCGCTC CTTGCATTAA TTTGCATGCG TCGTTATTGC C C AAAT AC AG GGGGGCTTCG 3 60 

CCCATTCATG AGATGATACT CAATGACGAT AGGATTTATG G C AT AAGC AC CATGCTTATG 42 0 

GAKTTTGGAA TTGGATAG 438 

(2) INFORMATION FOR SEQ ID NO: 56: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA .(genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.feature 

(B) LOCATION 1 . . . 747 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 

ATGCGTTTTT ATTTTAAATT CCTTTGGCTT TTAGGGATTT TTCTTATTTT TTATTTTTTA 6 0 

GACATTAAAG GCAGCTCTTC TTATATCAGC GACCGGGTTA AAAGCGCCTT GATGAGCGCG 12 0 

AAAAACAGCT TACTAGACAA CGTTCAAGCG TATTTTTTTC AAGCCCAAAA CATTAAGGAA 180 

TTTCAAAAAG AACGCTTGAT TTTAGAAGCT TTAAAACTAG AAAACGCTGA TTTGAAAGAG 2 40 

CGTTTGAATA GTATTTATCC TTTAGAAAAT CCAAAAATGA CTTATACCCC TACTTTCATG 3 00 

ACTTCATTCA IVAAl'TTAGA AGACACACAC AGCGTTTCTC TCAACCCTAT 7GTGAATT7A 3 60 

GAAGAAAACA AGATTTATGG CCTTGTCTCT CACAACCAAG CCATAGGCAT TGCCGTGCTA 42 0 

GAAAAAGGGC GCTTGAACGG GTTTTTGAAC GCCCACAAGC GGTGTGCTTA TAGCGTGATG 48 0 

ATAGGCCAAA ATCAAGTCTT AGGCTTTATA GGGACTAATT TCAAGCAAGA ATTAGTCGTG 54 0 

GATTTCATTG TCCCAAGCGC TGAAATCAAC ATAGGCGATC AAGTGCTAAC GAGCGGGCTA 600 

GATGGGATTT TTGGAGCGGG GGTGTTTGTG GGTGAAGTTT CAAGCGTTGA AGATCATTAC 6 60 

ACTTATAAAA GCGCGGTGTT GAAAAACGCT TTTTTAAGCG AAGCCAAACT TTTAAGGCAT 72 0 

GTGTTTTTAA GCGGTGTGAA AAACTAG 74 7 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1 . - .360 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 

TTGGCTTTAA GATTGCCTTT TTTGATCGCT CACGTCATCA ACATGTTTTT ATTCTACCTC 60 

ATAGGGCGAA AGATTTTAAA AAAGCCTAAA GACGCTCTTT ATGTGGTATT GACTTACGCT 12 0 

TTATTGCCTG GGGTGAATCT CTTTGCGATT TTACTGGCTA AAAGCGTGCT GGTGTTAAGC 180 

CTTGGGCTTT TGATTAGCTA TTTGTATATT AAAACCCAAA AAATCCCTTA TTTAACCCTT 2 40 

AGCGCTTGCG CGTTTTTAGA CGGTGCGTTC ATCCCGCTTT TACTAGGGGT TTTTGCCTAC 3 00 

GCTTTAAGGA AAACGGCTAT TTTAAGAGCG CGATCTTTGC TTTGGTGGTT TTAMTTGTGA 3 60 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 
£ B ) TYPE: nucleic acid 
( C ) STRANDEDNESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME/ KEY : misc^feacure 

(B) LOCATION 1 ... 327 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 

GTGAATTTAA TGGACTATTT TTCTAAAAGT TTGTTTTTAA ATTCATTGAA CACGCAGCGA 60 

TTGATCGTCT CCAACAAATT AGCGATTGAC GTGCAATACG GCATGCTCCA AAGTGTCCGC 12 0 

AAAAA TTACC CTGATGTGGT GGATGGGGGT GTTAGGGAGG GGCCTTTTTG GGTGTTAGCC 180 

GGGGCY-TAA TGCCTTCAAT TTTAATAGAA ATTGGTTATA ATTCCCATGC GATAGAATCT 240 

AAACGCATCC AAAGCAAACC GTATCAAAAA ATCTTGGCTA AGGGCATTGC TGATGGCATT 3 00 
GATAGTTTCT TCAGCAAGAA TGATTAG 

(2) INKURMAT1UN f'OK StiQ J-L> MO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . .474 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 

TTGGCGTCTC GCTATTCTGT GGCTGTTGGG AATTTATTTT CAGAGCATTT GTATGATTTA 60 

AGAAATGAAA CCATGACCAA TCTCATTGGT TTTTTACTGG TGTTGGCGTC CATTTGGGTG 1*0 

TrT>TTTtT , TT , AG C ^ CTTGGAGT GTTGCTAGGC AAGATGTTAG TCTTTAGCGG TCTAGGCATT i80 

ATAGACAAGG CGTTAGGGTT TATTTTTTCA TGTTTGAAGA CTTTTTTAGT GCTTTCTTTC 24 0 

ATCCTTTATG CGCTC^CTAA AATGGATTTA ATGAAAGACG CTAACGCCTA TTTGCAAGAA 3 00 

AAGAMCRCT" TTTTCCCCAC CATRAAAARC RTCRCTAGTA AGATCATGCG CCTTGATGGC 3 60 

G^CAAACATG TGGAGAAAAA CCTTAAAGAC AACCTTGAAG AAATGAGCGA TGAAGTTAAA 42 0 

AATAAAGGAT CTATTGATAA CGCCAAAGAA TCTTTTAATA AGGGCTACGG ATAA 47 4 

(2) INFORMATION FOR SEQ ID NO : 60 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix) FEATURE: 

(A) NAME / KEY : misc_f eatur e 

(B) LOCATION 1 . . . 246 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 

TTGAGCAAGC AAAGCGCGGA CATTGTGATC ACTAATGACT CCTTAAGCTC TTTAGTCAAA 6 0 

GTTTTAGCGA TCGCTAAAAA AACTAAAAGC ATTACTTGGC AAAATATCTT GTTCGCTTTG 12 0 

GGGATTAAGG CGGTTTTTAT CGTGCTAGGG CTTATGGGGG TAGCGAGCTT GTGGGAAGCG 180 

GTCTTTGGCG ATGTGGGGGT TACGCTTTTA GCYTTAGCCA ATTCCATKCG CACGATGAGG 240 

GCTTAA 246 

{2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: circular 

{ .i. x ) MOLECULE TYPE. DNA ( ycnoifdc } 

(iii) HYPOTHETICAL: NO 

fiv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc.feature 
{B) LOCATION 1...240 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 

ATGAAAAATT TAAGGCATTT TAGAAAGCTT A/TCGCCTTTT TAGGTTTTTC AC CTCTTTT A 60 

TTACAAGCGG ATATGACTAC CTTTTTTAAT TCCATTGAAC AACAGCTCAC TAGCCCTACG 12 0 

GCTAAAGGCA TTTTAATGGT TATTTTTTTA GGACTTGCTA TTTTTATATG GAAAAACTTA 180 

GATAGATGGA AAGAAATTTT AATGACCGTG CTTGCTTTAA AAGRAGTCCC CATGCAATMW 24 0 

(2) INFORMATION FOR SEQ ID NO : 62 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: mi sc_f eature 

(B) LOCATION 1 ... 978 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

TTGGC GGGTT TGCYAGTGGG GTGTATRCGG ATGAAACAAA CATTTTGGGR ACTTAGTTGG 6 0 

GGGGAAAAAA GCCAAAAGGT ATGCGTGCAT CGTCCATGGT ATGCTATATG GAGTTGCGAT 12 0 

AAATGGGAGG AAAAAACACA ACAATTTACA GGAAACCAAC TCATCACAAA AACTTGGGCA 18 0 

GGGGGTAATG CGGCTAACTA CTACCACTCT CAAAACAACC AAGACATCAC AGCCAATTTA 24 0 

AAAAATGATA ACGGCACTTA TTTTTTAAGC GGTCTGTATA ACTACACCGG AGGGGAATAT 300 

AATGGGGGGA ATTTAGACAT TGAATTAGGC AGTAACGCTA CTTTTAATCT AGGTGCGAGT 3 60 

AGTGGGAATA GCTTCACTTC TTGGTATCCT AATGGGCATA CTGATGTTAC TTTTAGCGCT 42 0 

GGGACTATCA ATGTGAATAA CAGCGTAGAA GTGGGCAATC GTGTGGGATC GGGAGCTGGC 48 0 

ACGCACACCG GC AC AGCC AC TTTAAACTTG AACGCTAATA AGGTTACTAT CAATTCCAAT 54 0 
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ATCAGCGCGT 
AATTCGGTTT 
AATTGCTCCA 
ACTTTTAGCA 
AAATTAAATG 
TTTAATAGCG 
AACGCTTCCT 
ACTTTTACTT 



ATAAAACTTC 
CTTTAAATGG 
CTTCTGGGCC 
AWTCAAGCGG 
GGGGGGCATT 
GTAGTTTTAC 
ATACTTTTAA 
TTAATTGA 



GCAAGTGAAT 
GGAAACTTGC 
TAGCTATTCT 
SAGTTTCACT 
CACTTTCAAT 
TTTTAAAGGC 
TAATCAAGCC 
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GTAGGCAATG 
AGKTCTTTAG 
TTTAAAGGGA 
TTTGAAGARA 
AAAAAGTTTA 
ACAAGCTCTT 
ACTTTCCAAA 



CTAACAGCGT 
CTAGGGTGGG 
CSACTAACGC 
ACGCC ACTTT 
ACGCTACCAA 
TTAATGGTGC 
ACAGCTCCTT 



T A T T AC C A TT 
CGTAGGGGCT 
TACTAACACS 
TAGCGGGGCG 
TAATACCGCT 
GAATTTTAGT 
TAATGGGGGG 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



600 
660 
720 
780 
840 
900 
960 
978 



(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(viJ ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : mi sc_ feature 

(B) LOCATION 1 . . . 816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 

TTGTTAAGTT TAGTTAAAGG GAAAACCATG CTCCGCTCTC 

ATGCTCGCCC AACAAACGCA CATTGACACC ACTTCAAACA 

ACCGGGTTTA AAAAATCTCG CGCGGATTTT AACGACTTGT 

GCCGGCACCA ACACAAGCAA CACGACTTTA TCGCCAGATG 

GTACGCCCTA GTGCGATTAC CAAAATGTTT TCGCAAGGCA 

AATTTAGATA TTGCTATTAC AGGTAAAGGC TTTTTTCAAG 

ACCGCTTACA CAAGGAGCGG GAATTTCAAG CTAGACGAGC 

GAGGGCTATC TCCTCATCCC TCAAATCACT TTACCCGAAG 

GGTGTGGATG GCACGGTGAG CGTGACTCAA GGCTTGCAAA 

CAAATCACTT TGGCTAATTT TGTCAATCCG GCGGGGCTTC 

TTTTCC ATC A CCAACGCTAG CGGCGATGCG ATTGTGGGCA 

GGCAAGTTAA GGCAAGGCTT TTTGGAGCTT AGTAACGTGA 

GATCTAATCA CCGCTCAAAG GGCTTATGAA GCCAATTCTA 

GCCA'TGCTCC AAACAGTCAA TTCCCTCAAA CGCTAA 

(2) INFORMATION FOR SEQ ID NO : 64 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI - SENSE : NO 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . . 273 



TCTATAGTGC 
ACATCGCCAA 
TTTACCAAGC 
GCATGGAAGT 
GCCCTAAAGA 
TCCAGCTTCC 
AGGGCAATCT 
ACACCACGCA 
CGACTTCTAA 
ATTCTATGGG 
ACCCGGATTC 
GATTGGTAGA 
AAAGCATTCA 



CACTTCAGGG 
TGTCAATACC 
GATGCAATAC 
GGGCCTTGGC 
AACGGAGAAT 
TGATGGCACT 
TGTAACAAGC 
AGTGAATATC 
CGTGATCGGG 
GGATAATTTG 
TCAAGGCTTA 
AGAAATGACA 
AACCGCTGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
816 



(x: 



SEQUENCE DESCRIPTION: SEQ ID NO: 64 
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ATGCAAAATG GGTATTATGC GGCCACAGGG GCAATGGCTA CACAATTTAA CCGCTTGGAT 6 0 

TTAACCTCTA ACAATTTAGC CAACCTAAAC ACCAACGGCT TTAAAAGAGA CGATGCGATT 12 0 

ACAGGCGATT TTTTAAGGCT TTACCAAGAA TACCGAGAGC AACTGCCCTT AGAAGATCAA 180 

ACCAAAGCGA GCGCGAAGTA TCTCAACCGC AMCCTCAATC GTGTGCCTAT TCTATCARAA 2 40 

ATCTATACKK ATAGGRAGCT TGGCYGCGTT TGA 27 3 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
( i i i J HYPOTHET x CAL : 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
{3) LOCATION 1 ... 565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65 

GTGGGGGCTA TGCCTACTAT CCAAATCCGT GRCTTTGGAG CGGGGGGTTC AG GGC ATAGC 6 0 

GATGCGACGC TCATGTTAGT TAATGGTATT CCTGTTTATA TGGCCCCTTA CGCTCACATT 12 0 

GAGCTAGACA TTTTCCCTGT T AC CTTTCAA GCCATTGATC GCATTGATGT GATCAAAGGT 180 

GGAGGCAGCG TGCAATATGG GCCTAACACT TATGGGGGTA TTGTCAATAT CATCACTAAA 240 

CCTATCCCTA ATCAATGGGA AAACCAAGCG GCTGAAAGGA YCACTTATTG GGCTAAGGCT 300 

AGAAACGCTG GGTTTGCCGC TCCCCYTGAT AAAACCGGCG ATC CTTCTTT CATCAAGTCT 3 60 

TTAGGCAACA ACCTCCTCTA TAACACTTAT GTGAGGAGCG GAGGGATGAT CAATAAGCAT 42 0 

GTGGGTATCC AGCGCAAGCT AACTGGGTTA GAGGCCAAGG CTTTAGGGAC AATAGCCCCT 480 

CTAGTATTTC AAACTATTGG CTGGATGGGG TCTATGACAT CAATGAAAGC AATGGG ATT A 54 0 

AAGCCTATTA CCAATACTAC GATTTTGGCT ATCGSCCAAC CGGGA 5 85 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_ feature 

(B) LOCATION 1 . . .2 55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66 

ATGAGAWAGG AGAAAATAAT GACGAATTTT GAAAAGRTTA TCGCGCAAAA CAGGCTCAAA 60 

ACGAACGCGG TTTTAACCAC TTACTGCGCG ATTTTTGCTT TTATTGGGTT GTTGGTGGAT 12 0 

GCTATTAGAA TCAACGCTAA TGATTTAGGT ATAGC CCTTT TTAAACTCAT GACTTTTCAA 180 

ATTTTTCCTA CGRTTACTAT TGTCATGTTT GTGGTGGCTT TTGTCATTAK KCKKAGTTTG 24 0 

TATCCAAAAT TTTAG 255 
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(2) INFORMATION FOR SEQ ID NO: 67: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

fix) FEATURE: 

{A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . . 231 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 

ATGGRCATGT CTCATATTAT TAAGAGCATT GAAGCTTTAG ATG AC TAT AC CATTAGATTC 60 

ACGCTTAATG GGCCAGAAGC CCCGTTTTTA GCGAATTTGG GCATGGACTT TTTAAGCATT 12 0 

TTGAGTAAGG ATTACGCTGA TTACTTGGCT CAAAATAATA AAAAAGACGA GTTGGCTAAA 18 0 

AAMCCTGTTG GGACAGGGCC TTTCAAATTC TTTTTGTGGA ATAAAAGATG A 231 

{2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . . 591 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 

TTGATGAGGA AAATTTTTTC TTATATTTCT AAGGTTCTAT TATTTATTGG GGTGGTTTAT 6 0 

GCAGAGCCTG ATTCTAAAGT GGAAGCCTTA GAAGGGAGGA AGCAAGAGTC TTC TTTGGAT 12 0 

AAAAAAATCC GCCAAGAATT GAAGAGTAAG GAATTGAAGA ATAAGGAATT AAAGAACAAG 180 

GATTTGAAAA ATAAAGAAGA AAAGAAAGAA ACAAAAGCCA AGAGAAAACC CAGAGCAGAA 2 40 

GTCCATCATG GGGACGCCAA AAATCCCACT CCAAAGATCA CGCCTCCTAA AATCAAAGGG 3 00 

AGTAGTAAGG GCGTTCAAAA TCAAGGCGTT CAAAACAACG CGCCAAAACC TGAAGAAAAA 3 60 

GATACAACCC CTCAAGCTAC TGAAAAAAAT AAGGAAACAA GCCCTAGCTC TCAATTCAAT 420 

TCCATTTTTG GTAATCCTAA TAACGCTACC AACAACACCC TTGAAGATAA GGTCGTAGGG 480 

GGCATTTCAT TGCTTGTTAA TGGTTCGCCT ATCACGCTGT ATCAAATCCA AGAAGAGCAA 54 0 

G AAAAATC T A AAGTGAGYAA SGCTCRAGCT AGGGATCGTT TGAKTCKCTG A 591 

(2) INFORMATION FOR SEQ ID NO : 69 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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f D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .54 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69 



tv: 7w^q AG AT 
CATAAAATGG 
CCTGTTACTA 
GTTGAAGAAA 
GACAAGTATG 
AAAGAAAACC 
TACAACCAAG 
GGGGTAGAAA 
CAAAAAACTA 



ATAATAAGAC 
CAGAACCAGC 
AGCCGGCTAT 
AGATCAAAGA 
ACATGCAAGT 
CGCTTGGCGT 
AAGATATGAT 
GAGAATGTTA 



'P 7\ /^T"T ,r nr ,r n r p^ 

TGTGGCTGGC 
TCCAGAGAAA 
TGAAAGCGGG 
ATCCGATCAA 
GCTTTTGGAA 
TAAAAGGACT 
CAAAACCATC 
CAGAGAAAAC 



f~: r PL or"'T" ; ""! ir T i n » 
GATGTGAGCA 
GAAGAGCCTA 
ACTATCATCG 
GAGACTTTAG 
GGCAATACCG 
TTGAGCGTGA 
AGTTTTGGCG 
AGAAGAGTGG 



T AT TG G T A GT 
CTAAAGCGGT 
AACAAGAGCC 
CTTCTATTTA 
ATGAGATCGT 
ATGAATTTGG 
AAAACGCTTT 
AAAGCAAACC 
ATGTCAAATT 



TGGCTGTA.GT 
TCAGACTGCG 
AGCTCCAGTG 
TTTTGATTTT 
GCAAAAAGCT 
CTCTAGCGAA 
AGTCATTAAA 
CAAATGCGTC 
AGTGAAGTAA 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .861 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 0 



ATGGGAACGC 
CGCATCAGTG 
GCAGAAACTA 
TTCATAACAA 
TTATCTGTAA 
AAGCTCACTA 
ACACGCTTAT 
AAAGAAAGGC 
GAAAATAAAG 
AATGAGCGAG 
GAAAAACGAT 
TTAGAAAGCG 
AACTCTAATG 
GMGCGAAGAA 
ACTTACACAW 



TCATTGAAAA 
AG TT AG AAG A 
GCGAGTTAAA 
AAGACAAGCT 
AAATCAGCGG 
AAGAAAAAGC 
TAGCAGCCAG 
TAAAAAGAGA 
CACTAACCAC 
ATAGTCTCGA 
GCACCAATTT 
CTCAAAAAAG 
TCCAATTAGC 
ATCGCACGCT 
CAGGCGTTTA 



ATGGTTTGGC 
CGAAAACACC 
AGACGCTAAC 
AACCAAAGAA 
GTTAGAACAC 
AGAACTGAAA 
AGATCGGCTG 
AAACACCGAG 
CGAAAACGAC 
ACAAGAGCGA 
AG AG AAAG AA 
CTTGGAAAAC 
ACAAGCTAAA 
TGAAGAGCTT 
G 



TTCTCTCAAA 
GAATTGTTAA 
GATCAATTAC 
AACACCGAGT 
TCTAACGATC 
ACGGAAAAAG 
ACTGAAGAAA 
CTAACCCATA 
AAGCTCAACC 
GCGCGATTGC 
AACCAACGCC 
ACTAACAATC 
GARARAATWG 
AGAGGGTATG 



TCAGAGAAGA 
GAGAAAGAGA 
GGCAAAAAAA 
TATTCGCAGA 
AATTATGGCA 
ACATTTTAGC 
AAAGAGAATT 
AAATCACCGA 
ACCAAGTTAC 
AAGATGCGCA 
TAACTG AC AA 
AATTACGGCA 
CCATAGAGRA 
GAAGCCAAAA 



ATTAGAAGCT 
ATACTTAGCT 
CG AC AAG TT A 
AAACGAAAGC 
AAACAATAAC 
TAAAGAAAAC 
GACAACAGAA 
GCTGACTAAA 
CGCGCTCACT 
TGGGTTTCTA 
GCTCAAACAA 
AGCTTTAGAA 
AAGCGAGCTG 
GSCGATCTGG 



AO 
120 
180 
240 
300 
360 
420 
480 
540 



60 
120 
180 ' 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
861 



(2) INFORMATION FOR SEQ ID NO : 7 1 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 333 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : doubl e 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



Ciii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME / KEY : mi sc_£ea tur e 

(B) LOCATION 1 ... 333 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71 

GTGTTGCGCA AGCTTTTGGG TAAAAATTGC ATAGAAACGC ATAAGGGGGT GGGCTATCGC 6 0 

TTAACCCACT ATGAAAAAAA ATCCCTCAAA CTCTTTTTAG GGACTTATTT AGGCTCTTCG 12 0 

T^TGTGTTAA TGCTAGTGAT TAGCGTTTTA GCGTTTAACT ATGAAAAAAA CGAAAAAATC ISO 

AAARTGATAC GCATGGACAT GGACAAAATG GCTTCTAAGA TCGCTAGTGA AATTATCCAA 24 0 

TTGCACATGC AAACGCATGC GGATTATCAC AACGCTTTAA ACGCCCTGAT TTCACGTTAT 3 00 

AAAGACGTTT CCATAGYCCT YTYYGAYACG TAA 333 



(2) INFORMATION FOR SEQ ID NO: 72: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc.feature 

(B) LOCATION 1 . . .3 75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 

TTGATGACCA AAAGCTTAAA ACTCATTCAA AAAGGGGTTA 

AAAAATAGGG CTTTAGAGCA TCAAGACACG CTAATGGTGG 

GGCGAACCCA TCACCTTTGG TTTAGTTTTA GCTCTTTTTG 

TTAAAAGCCC TGGATTTAAC GATGGAATTT ATCRGCGTWG 

GGGAATTTCG CGCACGCCCC TTTAGAATTA GAAGAATTAG 

AAAACCGCTA ATATCAGCAA TCAAGTCATT CAAAGAGACC 
ATCTGGCTCT TTTAG 



AAAACCTCTA 
GCAGAAGCCA 
CTGATGAAAT 
GGGCGATCAG 
CGTGCGGATT 
GCTACGCAGG 



TGAAACCCTT 
TGGGGTGTTT 
CAAACGGCAT 
TGGGGCTATG 
TTTAGGCTTA 
CTTGCATGCG 



60 
120 
180 
240 
300 
360 
375 



(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORG AM ISM : Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . . 288 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 



TTGTCAGACG CTTCAAAAAG ATCCCTTAAT CCAACCTTAA TGATGAATAA TAATAATACC 6 0 

CTACCCAAAC CCCTAGAAGA AAGCCTAGAT TTAAAAGAGT TTATCGCTCT TTTTAAAACC 12 0 

TTTTTCGCAA AAGAAAGAGG TTCTATTGCT TTAGAAAACG ATCTCAAACA GGCTTTCACT 180 

TATTTAAATG AAGTGGATGC GATCGGTTTG CCTGCCCCCM AAAAGCGTGA AA G AAA G C G A 24 0 

TCTTATTGTT GTCAAACTCA CCAAATTAGG GACGCTCCAT TTAGATGA 2 88 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 43 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iiij HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . .243 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 



TTGCCTATTA TTTTAYCTGT AATCGTGATG ATGTTTTTTT CCAAAATCGT TGGCGATTTT 6 0 

ATTGAAAAGC ATTATCGCGT CAAAACTTTA GCCTTTGTGT TTTTGCTCGT TGTGGGCGTG 12 0 

TTTTTGTTTT TAGAAGGCTT GCATTTACAC ATCAATAAAA ACTATTTGTA TGCGGGTATT 18 0 

GGTTTTGCCT TGCTCATAGA ATGCTTGRAT ATTTTCATAG AAAAGAAAAT GAAAAAAAGT 24 0 

TAA 24 3 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

( iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( Tx ) FEATURE : 

(A) NAME /KEY : misc_f eature 
(3) LOCATION 1 . . .7 98 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75 
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ATGATAAAAG CGCGGTTTAA 
A TAG AC TT AG AAATTAAAGA 
AAAAGCACGA TCTTACGCAT 
GCCAATCATT CAGTATGGTT 
AAAATCGGCT TTGTGTTTCA 
ATCGCCTTTG CTCACCCTAA 
TTAGAAAACC TAAGCCAGCA 
GCTTTAGCAA GAGCTTTAAT 
GCCC TAG AT A ACGCCTTAAA 
GAAAATTTAA GCGTGTTATT 
ACTTTCCTCT TTTTAAACAA 
AACCGCTTAT TGGTAAAACC 
CCTCAAACGA TCAGTTTGCC 
CAAAACAAAA AATTTTAG 

(2) INFORMATION FOR S 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 195 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 . . . 195 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 

GTGAAATTCA GCGTTTTAAC CCTTTTCCCG CAACTCATCT TGCCTTATTT TGAAGATTCT 60 

ATTTTAAAAA GAGCGTTAGA AAAAAACCTT TTTGAATTGG AAGTGTTAAA C CTT AG AG AT 120 

TTTAGCGCTA ACAAATATCA AAAAGCGGAK TCACACGCTC ATTGGTGGGG GTGCGGGGCA 180 

AATTTTAG AC CCTGA 195 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

{ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : mi sc_f eature 

(B) LOCATION 1 . . . 414 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77 

TTGTGGCGCA CACCGAAGAC RCCCTTAGTC ATTAAACCCT ATTTGAAAAG CATGAGCGAT 60 
TCAGAGATYT TTGCGGYCAY GTGCGTGGGY ATGGCYAGCG TTRCGGGGCC TGTGTTAGCC 12 0 
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AAAACGCCTT 
AGCAGAAGTT 
TTTAGCAGGG 
AGACACTCAA 
AGATTACGCC 
AGATAAAAAT 
AAAAATTC CC 
CGCAGCCAAA 
AAACGAGGTG 
GGTAAGTCAT 
TGGCGTTATT 
TCTCTTTGAA 
CAAAGATTGT 



TTAGGATCTA 
GTCGCTTTAT 
CTTGAAGCGG 
AAAAAGATTT 
CTATTCCCTC 
AAAATCCACG 
AAACTCTCTG 
AATCTATTGC 
CAACAAGGTT 
GATCCAAACG 
GATCCTAATC 
GATGAAAATT 
CTGAACCCAA 



GGGGCGCGTT 
TAGGAGAATC 
TGAGTAGCGG 
TTTTAAAACC 
ATTTAAACGT 
AAGTGTTACG 
GCGGGCAAGC 
TTTTAGATGA 
TGCTTGATTT 
AAATAACCAA 
AAGAAAATCG 
ATTGCCATTA 
CTTTTAAGCT 



TGATTTGAAT 
GGGAGCGGGT 
CTATATTGAA 
ACAACAGCGA 
GTATCAAAAC 
CTTAATGCGT 
CCAACGAGTC 
GCCTTTAAAC 
TATCAAGCGT 
ACTCGCGCGA 
GCTTTTTTCA 
TGAGGTCATT 
TGATTTCATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
798 
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GGGTATGCGA GCATGGGCAT TCCTTTACCT TATTTAATCG CCGCATCGTT TATGTCCGCT 180 

CCTGGGGGGT TGCTGTTCGC TAAAACCATT TACCCGCAAA ACGAAACCAT TTCTAGCCAT 24 0 

GCAGATGTTT CTGCAGAAGA GCATGTCAAT ATTATAGAAG CTAYCGCWMA TGGGGCAAGC 3 00 

ACAGGGGYTC ATTTAGCCTT GCATGTGGGG GCGATGCTTT TAGCCTTTGT GGGGATGGTC 3 60 

GCGCTCGTTA ACGGGCTTTT AGGGGTTGTA GGGGGATTTT TAGGCATGGA GCAT 414 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 ... 348 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 8 

GTGATGAACT TTTTTGTGGG CGGACTTTCC ATTGTTTGTA ATGTGGTGGT CATCACTTAC 6 0 

TCCGCGCTCC ACCCTACAGC CCCTGTAGAA GGTGCAGAAG ATATTG TTC A AGTATCGCAC 12 0 

CATTTGACCA GTTTCTATGG GCCAGCGACT GGGTTATTGT TTGGKTTTAC CTACTTGTAT 180 

GCCGCTATCA ACCACACTTT TGGTTTGGAT TGGAGACCCT ATTCTTGGTA TAGCTTATTC 24 0 

GTAGCGATCA ACACTGTTCC TGCTGCGATT TTATCCCACT ATAGCGATAT GC TTGATG AC 3 00 

CACAAAGTGT TAGGCATCAC TGAAGGCGAT TGGTGGGCAA TCATATKG 3 48 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv} ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1- . .684 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 7 9 

GTGCTTTTGG GCAAGCATAG TGGGGCGGGG TTGTTGAGCG CTTTAWGCGC GTTAAGTTTT 6 0 

GGATCTGGGG TGGTGAGTAT CCAAGCGTTA GAGTGCGAGA TAACTTCTAA TAACAAGCCT 120 

TTAGAATTGG TTTTTTGTGA AAATTTCCCT AAAAAGCTCA GCGCGTTCGC TCTTGGCATG 180 

GGG TT AG AAA ATATTCCAAA GGATTTTAAG AAGTGGCTTG AATTAGCCCC ATGCGTTTTA 24 0 

GATGCGGGCG TTTTTTATCA TAAAGAAGTG TTACAAGCCT TAGAAAAAGA AGTGATCTTA 3 00 

ACCCCTCACC CTAAAGAGTT TTTATCGTTA TTGAAATCAG TGGGGATCAA TATAAGCATG 36 0 

CTAGAATTAC TAGACAATAA ACTAGAAATC GCAAGGGATT TTTCTCAAAA ATACCCCAAG 42 0 

GTGGTTTTGC TTTTAAAGGG GGCTAATACC CTAATCGCTC ATCAAGGGCG GGTTTTTATC 4 80 

AACAATTTAG GGAGCGTGGC TTTRGCCAAA GCAGGCAGTG GCGATGTGTT AGCGGGGCTG 54 0 

ATTGTAAGCC TACTTTCTCA AAACTACACG CCTTTAGRCG CCGCSATTAA CGCAAGTTTG 60 0 
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GCGCACGCCC TAGCGGGTTT ARAATTTAAG AATMATTAMG CTTTAACGCC CSTAGATTTG 660 



ATAGAAAAGR TCAAACGACT ATAA 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D ) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(VI) ORIGINAL, SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : mi s c_ feature 

(B) LOCATION 1 . . . 328 

{XI) SEQUENCE DESCRIPTION: SEQ ID NO: 80 



684 



60 



GTGCYT^TAT ACCTAGCACT AACCTTGAGT TTAGGCATTG CTATGCTTTT AGTGGAAATG 

CTGATTGGAA ATTTGGGTAA AAAAGACGTT GTTTCCAATT ATCAAATCTT AGATCCTAAA 12 0 

AGGAAAAAAT ATTACCCTTT CACTTCTTTT TTTATTTTAG GCGGCCCTCT CATTCTATCT 180 

TTTTATGCGG TGGTGTTAGG CTGGGTGCTT TACTATCTTT TTG TAG T AAC TTTTGATTTG 24 0 

CCTAAAGATT TAGMGCAGGC TAAAATGCAR TTCMGMATGC TTCAAAATGG CAGTTTGATC 3 00 

TGGCCTGTTA TTGACTTTAG CGCATGCT 328 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{ D } TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc.f eature 

(B) LOCATION 1 . . .294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81 

TTGACAACAA AAGCGTGTTG GTTGCTTCGG GTTTGTTGTT ATAGAAGTCT AAATATTACA 6 0 

ATCAAGGATA GAACGATGAA AACCAATGGT CATTTTAAGG ATTTTGCATG GAAAAAATGC 12 0 

TTTTTAGGCG CGAGCGTGGT GGCTTTATTA GTGGGGTGTA GCCCGCATAT TATTGAAACC 180 

AATGAAGTTG CTTTGAAATT GAATTACCAT CCAGCTAGCG AGAAAGTTCA AGCGTTAGAT 24 0 

GAAAAGATTT TACTTTTAAG GCCAGCTTTC CAATACAGCG AKAATATTTG CTAA 29 4 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 
<B) LOCATION 1 ... 438 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 



TTGAG TGAGT 
CCACCTACCA 
TTATCCACTT 
TTTAAATCAT 
CCCATAGGTT 
ATTATCCCTA 
GATAACATTT 
GGKTCTATTC 



GGCAAACATT TTGTTTAAAA 
ATAACCCCAA AAATTATGGC 
TACAAGGGCG C T ACATT AAA 
GCTCTTGTGT GTTGCTCCCA 
ATGTGGCAAT TGCTGAAAAA 
ACAAAAAAAT TTATTTTGAA 
CCAACATAGG GGGCGGAACT 
CAAGTTAA 



GATTTAGGGA 
AATaAAATtG 
AAAGGCAGCC 
AAGCATGCCA 
AGGCTATGCA 
TTTTTATATT 
ACTTTTAAAG 



AAATAGTCGG 
CTTGGAT'i AC 
GCAGCATTTC 
TTTTATTTTC 
CCAATCAAGG 
ACTTATTAAA 
AAGTTTCAGG 



CGGCGCTACC 
CCCtAaauAt 
ACGATTAGGG 
TTCAAGAGCT 
TTTTAAAAGT 
ATACTATAAG 
GGCTACTTTA 



60 
12 0 
180 
240 
300 
360 
420 
438 



(2) INFORMATION FOR SEQ ID NO: 83: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eatur e 

(B) LOCATION 1 ... 822 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 



ATGGAATTTA 
TTGTTAGCCG 
TTGAATAGCA 
GGTCTTACCG 
GCTAATGGGG 
AAGTCGCTAG 
GGGCATGCCG 
TCTTGGGGTG 
GGTATTTTTG 
TGGAAAGAGC 
CCTAATGCCC 
GGGGTGAGAG 
CTCATCAATA 
CGGGATTATT 



TGAAAAAGTT 
AAGGTGATGG 
ATATTTATAA 
CTAATAAGCA 
CTTTGAATGG 
ATATGACAAG 
ATTTAGGTAA 
TGGGGAGCGA 
GTGGGGTCGC 
AAATCATTGA 
CTTATAGCAC 
CCAATATCTA 
AATTTTTGAG 
CGCTTTATTT 



TGTAGCTTTA 
TGTTTATATA 
TACAGGGGAT 
TAATCCAGGA 
TTTTGGGTTG 
CAAGTGGTTT 
AC AAG TTTAT 
TTTGTTAGCT 
TATCGGCGGT 
AGCCAAAGGT 
CAACACTTCA 
CAAGCATAAT 
CGCGGGTCCT 
GGGGTATAAC 



GGGCTTCTAT 
GGGACTAATT 
TGCACAGGGA 
GGCACCAATA 
AATGTGGGTT 
GGTTTTAGAG 
GCACCTAATA 
GATATTATTG 
AACACTTGGA 
CCTGATGTTT 
ACCGTCGCTT 
GGCGTGGAAT 
AACGCTACTA 
TACACTTTTT 



CCGCGGTTTT 
ATCAGCTTGG 
GTGTTGTAGG 
TCAATTGGCA 
ATAAGAAATT 
TGTATGGGCT 
AAATCCAGTT 
ATAAAGACAA 
AAAGCTCTGC 
GTACCCCTAC 
TTCAAGTGTG 
TTGGCGTGAG 
ACCTTTATTA 
AA 



AAGCTCTTCG 
ACAAGCCCGT 
TTGCCCCCCA 
CTCCAAATAC 
CTTCCAATTC 
TTTTGATTAC 
GGATATGGTC 
CGCTTCTTTT 
AGCAAACTAT 
TTATTGTAAC 
GTTGAATTTT 
AGTGCCGCTA 
CCATTTGAAA 



(2) INFORMATION FOR SEQ ID NO: 84: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 447 base pairs 

( B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
822 
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<D) TOPOLOGY: circular 

( i i ) MOLECULE TYPE : DNA ( genomic ) 

{iii> HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

i A ) ORGANISM : Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 
<B) LOCATION 1 . . .447 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 

„ „ , _ „ „ m ^ ^ ^ s^mmmm -* rryr^r^m mm/~> r*>r*> <— ■ rprnco^/^rririrn ^ m> fpr*/ 1 1 mrT'^ £■ <"\ 

ATCCCGGCAG GCATTATTGC GTTTTCTTTT AATGTGGGGG CATACGCTTC AGAGACTTTG 120 

AGGGCGAGCT TTCTTTCTGT CCCTAAAGAT CAATGGGATT CAAGCTTGAG TTTGGGCTTG 180 

AATTACTTGC AAACCTTTTG GCATGTCATC TTTTTTCAAG CGCTCAAAGT CGCCACGCCA 2 40 

AGCCTAAGTA ACACTTTCAT CAGCCTTTTT AAAGAAACTT CTTTAGCGTC TGTGGTCACT 3 00 

ATCGCAGAGG KTTTTAGAAT CGC ACAGCAA AAAGYGAACG TCAGCTATGA CTTTYYGCCT 3 60 

ATTTATTTGG AAGKCGCTTT GATTTACTGG CTTTTTTGCT TGGTTTTAGA AGTGATTCAA 4 20 

AAGCGCGTGG AAAAAATCTT AAATTAA 4 47 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

<iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : mis cofeature 

(B) LOCATION 1 . . . 405 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 85 

GTGGTGGCTG ATGAAGTTAG GAATTTAGCT GGGCGCACTC AAAAGTCTTT AGCCGAAATC 60 

AATTCCACTA TCATGGTGAT TGTCCAAGAA ATCAATGATG TGAGTTCGCA AATGAATCTC 12 0 

AATTCGCAAA AAATGGAGCG CTTGAGCGAT ATGAGTAAAA GCGTGCAAGA AACTTACGAA 180 

AAAATGAGTT CTAATTTAAG CTCAGTCGTT TTAGACAGCA ATCAAAGCAT GGACGATTAC 2 40 

GCTAAATCCG GACACCAAAT TGAAGCTATG GTAAGCGATT TTGCAGAAGT GGAAAAAGTG 3 00 

GCTTCTAAGA CTTTGGCTGA TTCTTCAGAT ATTTTAAACA TCGCTACGCA TGTGAGTGGA 3 60 

ACGACCATGA ATTTAKACAA ACAAGTGAAT TTGTTTAAAA CTTAA 40 5 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 
<iii) HYPOTHETICAL: NO 
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(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc__f eature 

(B) LOCATION 1 . . . 402 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86 

ATGAATTACG ATAACTATTG GGATGAGGAC AAACCAGAAC TC AATATCAC GCCTTTAGTG 60 

GATGTGATGC TTGTTTTATT GGCTATTCTT ATGGTAACGA CGCCCACTCT C AC CTATAAA 120 

GAAGAGATTG CCTTGCCTTC TGGTTCAAAA ACTGCTAGAG CCACTCAAGA TAAAGTGATA 180 

GAGATACGCA TGGATAAAGA CGCAAAAATC T AT AT AG ATA GTCAAACCTA TGAATACAMC 240 

TCTTTCCCGG ACACTTTCAA TTTGCTTTCT AAAAAATACG ATAAAGATAC TAG GG TG AG T 3 00 

ATCCGTGCGG ACAAGCGATT GACCTATGAC AAAGTGATTT ATTTGTTAAA AACGATTAAA 3 60 

GAAGCGGGGT TTTTGAAAGT TTCTTTAATC ACAAGTCCTT AA 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

{ A ) LENGTH: 216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1...216 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87 



ATGCCACCCA CACSCCCCCA AGCGAGTATT TTAAGGCTAA CCCTAAAAAA CCCTTTGMGC 6 0 

MTGCTATCTC GTTATTCGCT CTGTCTGTTG AAAAAAACGC GCTTGCAAAC AACATCAAAC 12 0 

AGCGCACCAA AAGCATGCTT GATTGCGGGC TTATTGAAGA AATCAAAGCC CTTTATATTA 180 

AATACCCTAA AGATTCGCAG CCTTTTAAAG C CAT AG 2 * 6 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc.f eature 

(B) LOCATION 1 . . - 654 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 8 
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ATGCCTGTTA TAAGAGTTTT AGTAATGCTT GCAACAATGA TGATGAAATT AGTAAAAACG 6 0 

GCAAAAGAAA AGAAAGTTTT TAAGAATGTG GGAATATCTA TAATGGGGAT TGCTTTTTGG 120 

GAAGCGA7AA AAGACTCGAT AAAAAAACAA ATTAAAAAAA GCGATTGGAT ATGCGGGAAT 180 

GTTAAGACTG CGGATGATTA TTTAAAAACG CATCCTAACT CATGGTTTAA TTCAGCAATA 2 40 

GGTGTAACAG CG AT AACAGC CATGCTTATG AATGTGTGTT TTGCTGATGA CC AATCCAAA 3 00 

AAAGAAGTGG CTCAAGCTCA AAAGGAAGCT GAAAACGCTA GGGATAGAGC GAACAAGAGT 3 60 

GGGATAGAAC TGGAACAAGA AGAGCAAAAG ACAGAACAAG AAAAACAAAA GACAGAACAA 420 

G AAAAA C AAA AGACAGAACA AGAAAAACAA AAGACAGAAC AAGAAAAACA AAAGACAGAA 4 80 

CAAGAAAAAC AAAAGACAAG CAATATAGAG ACTAACAATC AAATAAAAGT AGAACAAGAA 5 40 

CAACAAAAGA CAGAACAGGA AAAACAAAAG ACAAACAATA CGCAAAAAGA TTTGGTTAAC 6 00 

AAAGCAGAAC AAAATTGCCA AGAAAATCAT AATCAATTCT TTATT AAAAA TTAA 6 54 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 
( D } T V PE i ria clcic £ c i d 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.f eature 

(B) LOCATION 1 . . . 228 

{xx) SEQUENCE DESCRIPTION: SEQ ID NO: 89 



ATGGTTATTT CTGGGCATTT CACCACTTAT AGCTATATTG AGCCTTTTAT CATTCAAATC 60 

AGCCAATTTT CTCCTGACAT TACAACGCTA ATGTTGTTTG TGTTTGGGTT AGCAGGCGTG 120 

GTGGGGAGTT TTTTGTTCGG CCGTTTGTAT GCGAAAAATT CAAGAAAATT TATCGCTTTT 180 

GCAATGGTTT TAGTCATTTG CCCGCAACCT CTTGCTTTTT GTGTTTAA 22 8 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

( iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .57 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 



ATGAAATCTA CAAGAATTGG TTCTAAAATT GTCATGATGG TGTGTGCGGT TGTTATTGTC 60 

ATTAGCGCTG TTATGGGCGT TATTATCAGC TACAAGGTTG AAAGCGTGTT GCAAAGCCAA 120 

GCCACAGAAT TGCTGCAAAA AAAAGCTCAG TTAGTCAGTT TTAAAATTCA AGGCATTATG 180 

AAGCGCATTT TTATGGGCGC TAACACCCTT GAAAGGTTTT TAAGCGATGA AAATGGCGCT 2 40 

ATTAATGACA CCCTAAAAAG ACGCATGCTC TCTGAGTTTT TGTTAGCAAA CCCTCATGTG 3 00 
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TTATTGGTTA GCGCGATTTA TACGAATAAT AATGAACGAA TGATCACTGC AATGAACATG 3 60 

GATTCAAAAA TCGCCTACCC TAATACCGCA CTCAATGAAA ACATGACCMA CCCAATCCAT 42 0 

TCGCTCAAAA GTATAACCCG TTCARATCCC TATTATAAAG AGG TT AATG M CRATAAAATC 4 80 

TATRRCATRR ACATTACCCT CCCCCTAATR RRCAARAATY AAAATRTTAT ARRCRCWCTW 54 0 

AATTTC KTTT TAAACATTGA CWGCTTTTTA TACTGM 576 

(2) INFORMATION FOR SEQ ID NO : 9 1 : 

( i } SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 
ID) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .762 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 

ATGG C AT AC A AATATG AT AG AGACTTGGAA TTTTTAAAGC AACTGGAATC TAGTGATTTA 60 

TTGGATTTGT TCGAGGTGCT TGTTTTTGGT AAAGACGGCG AAAAAAGACA CAATGAAAAA 12 0 

CTCACAAGCT CCATAGAATA CAAAAGGCAT GGCGATGATT ACGCTAAATA CGCAGAAAGA 180 

ATCGCTGAAG AGTTGCAATA CTATGGGAGC AATAGTTTTG CGAGTTTCAT TAAAGGTGAA 240 

GGAGTCTTAT ACAAAGAGAT TTTATGCGAT GTGTGCGATA AATTAAAGGT CAATTACAAC 3 00 

AAGAAAACTG AAACGACTTT AATTGAACAA AACATGCTTT CTAAAATCTT AGAAAGAAGC 3 60 

CTAGAAGAAA TGGATGATGA AGAAGTGAAA GAAATGTGCG ATGAATTGTC CATAAAAAAC 42 0 

ACGGACAATT TGAACAGACA AGCCTTAAGC GCGGCGACTT TAACGCTGTT TAAAATGGGA 4 80 

GGCTTTAAAT CTTATCAATT AGCTGTCATT GTTGCGAATG CGGTTGCAAA AACCATTCTA S4 0 

GGGCGTGGTT TATCGCTTGC GGGCAATCAA GTGCTTACAA GAACTCTGAG CTTTTTAACA 600 

GGCCCTGTTG GCTGGATCAT TACAGGCGTA TGGACAGCGA TTGATATTGC AGGGCCGGCT 66 0 

TATAGGGTAA CCATACCGGC ATGCATTGTG GTCGCCACTT TACGCCTAAA AACGCAACAA 72 0 

GCCAATGAAG ATAAGAAGTC GTTGCAAATA GAATCCGTTT AA 7 6 2 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

{ i i ) MOLECULE TYPE : DNA ( genomic ) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feacure 
<B) LOCATION 1 . . . 882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 

TTG TTGCTTT TTATCGTTGT GATCACCTCT TTGGTTAAAA ACACGATCCC AAATATTTGG 60 
CTCAC^AAAA TCCTTTATAT GGCTATCTTG CTTTGCGCGA TCGCTCATTC TGTGGGGCYA 12 0 

ATCTTGCGYT GGTATGTGAG TGGGCATTCG CCTTGGAGTA ACGCTTATGA GTCCATGTTC 180 
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GAMCTAAACT 
ATTTAGGCTT 
TCAATATCCA 
TAGGGATTTT 
AAACCATTCT 
TCACAGCCGG 
GGTGGGACCC 
ATTTGCGTTT 
GGTTTTATTC 
AGCTATGCCG 
CCTTTCGCTC 
AA 



CGCGCTATCG 
TATGGACCCT 
TGTCTCTGTC 
AAGTTTGGTT 
TTCCATTAGC 
GAATTTCTTA 
T AAA G AAA CT 
TTTAGGCTCT 
GGTTTTAATG 
CAGGKSATCS 
TCGTATCTTG 



TATATCGCAT GGGCTTCTGT TATCGCAGGG TTTGTTTTAC 

GCTTCTAGCT TTTTGGCCGG TATCGCGCTC TTTGTGGCTC 

CAAATTGGCC CTTTAGTGCC GGTGTTAAAA TCCTATTGGC 

ATCACCGCTA GTTATGGCTT TTTGGGCTTG TGTTTTGTGC 

TTGTTTATTT TGCGCAAACA AGGGCGTTTC AATTTAGACA 

GCTATCAATG AAATGAGCAT GATTTTAGGC CTGTTCATGC 

GGTGGGGTGT GGGCGAATGA ATCTTGGGGG CGCTATTGGG 

TGGGCGTTGA TTTCTATTTG CGTCTATGCC TTGATCTTGC 

CAAAATTGGC CCTTTATTTT AGCGAGCAGC AGCG TGCTAG 

ACTTTATTTT GGCGTGAATT ACTACCTTTC TGGCTTGCAC 

TTTGCCGATC CCTACTTTTT TATACTTTTT GGTAGCGATA 

GCGTATTTCA AACGTCATTT GAGTTTGCCT AAATTGGTTT 

(2) INFORMATION FOR SEQ ID NO; 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

* * mi r-f^ _ . . _ 1 _ J _ J 

\ ID J 1 I r* L* . i i u. -l t: J. \_ <*<*j,\u 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

( i i i ) H Y POTHET ICAL : NO 

{iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . . 429 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 

GTGGAAATGA TCCACACGCA AGATTACATT AAAATGGAAG 

AAGCGTAAGG AATCTTCCAT TTACTTGGGC ATGGATATTT 

TTGATTTCAG CGGGGCATAG CGGGGCGACT ATGGGTTTAG 

ATCAAGGGGG TTGAAAGGCC TGCTATTTGC ACTTTAATGC 

AGCG TGC TAT TAGACGCAGG AGCGAACACG GATTGCAAGC 

GCTCTCATGG GGTATGAATA CGCTAAAAGC GTGTTGCATT 

CTTTTGAGTA ATGGTGAAGA AGATATTAAG GGGGGAATAC 
AAATGTTGA 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
{ B) LOCATION 1 ... 433 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 

ATGCTAGAAA TCAAGAATTT AAACTGCGTT TTAAACTCTC ATTTTTCGCT CCAAAACATC 



AAGCCGCCAC 
TAAAAAATGG 
CGACCTTGCG 
CTAGCGTTGG 
CTGAATATTT 
ATGACAGCCC 
GCTCGTTAAA 



TGAGGCGATT 
GGCTGACGCT 
TTTAGGGCGT 
CAAACGCCCT 
GATTGATTTT 
TAAAGTGGGT 
GAAACGCATA 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
882 



60 
120 
180 
240 
300 
360 
420 
429 



60 
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AATATTTCTT TAAGTTATAG TGAAAGGGTG GCGATCGTGG GCGAAAGCGG GAGCGGGAAA 12 0 

AGCTCTATCG CTAATCTCGT CATGCGATTA AACCCTAGAT TCAAGTCCCA TAATGGCGAA 180 

ATCCTGTTTG AAACAACCAA TCTTTTAAAG GAAAGCGAAG CGTTYMTGCA GCATTTAAGG 24 0 

GGGAATATTA TCGCTTACAT CGCCCAAGAC CCCCTATCCA GCCTCAACCC CTTGCATAAA 3 00 

ATCGGCAAGC AAATGAGTGA AGCCTATTTT TTACACCATA AAAACGCTTC TCAAGTGTCT 3 60 

CTTAATGAAC AAG T TTT AAA CGTTATGAAA CAAGTTCAAT TGGATGAAAA TTTTTGGAAT 42 0 

GTTTCTCTTA TGT 43 3 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

<iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc__f ea cure 

(B) LOCATION 1 . . . 252 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 95 

ATGAACTACA AAGTTGCATC TGCTAGAAAT ATCGCAACGC TTCTTTTCTT ATTCTTTTCT 60 

CAAAGTGAAG CTTTTGATTT GGGTAAAATC GCTAAAATCA AAGCGGGTGC TGAAAGTTTC 12 0 

TCTAAAGTCG GTTTCAATAA CAAACCTATC AACAMTAATA AAGGGATTTA CCCTACCGAA 180 

ACCTTTATGA CGATTAATGG CTTACATGCA GGTGGATTTT ACGGAGCTCT TGCCCAAAAG 240 

CGCTACGGCT AA 252 

(2) INFORMATION FOR SEQ ID NO: 96: 

{ i ) SEQUENCE CHARACTER I STICS : 

{A) LENGTH: 3 93 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_feacure 

(B) LOCATION 1 . . .3 93 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 

ATGGACGCTT TAGAAATCAC CCAAAAACTC ATCAGTTACC CTACCATTAC GCCCAAAGAA 60 

TGCGGTATTT TTGAATACAT TAAATCGCTT TTTCCTGCTT TTAAAACCCT AGAATGTGAA 12 0 

AAAAATGGCG TGAAAAACCT TTTTTTATAC CGCATTTTTA ACCCCCTCAA AAAGCATGCA 180 

GAAAAAGAAC ATGCAAAAGA AAAGCATGTR AAAGAAAATG TTAWGCCCTT GCATTTTTGC 2 40 

TTKGCAGGGC ATATTGRTGT CGTGCCTCCT GGGRRCAWTK GGCRRRSKGA TTCCTTTWWA 3 00 

YCCATCATTA AAGAGGGGTT TTTATACGGT CGTGGGGCGC AAGACATGAA GGGGGGCGTT 3 60 

GGGSSGTTTT TKAGGTGCRR GTTWAAATTT TAA 3 93 

(2) INFORMATION FOR SEQ ID NO: 97: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

{iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: mi cc_ f tur c 

(B) LOCATION 1...1023 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 



ATGATTTTAA GC ATTGAAAG TTCTTGCGAT GACAGCTCTT TAGCCCTTAC AAGAATAGAG 60 

GACGCCAAGC TCATCGCTCA TTTTAAAATC TCTCAAGAAA AGCACCACAG CTCTTATGGG 120 

GGCGTTGTGC CTGAGATTGC ATCGCGCCTG CATGCTGAGA ATTTGCCGCT TTTATTAGAA 180 

CGCGTTAAAA TAAGCTTGAA TAAGGATTTT TCCAAAATTA AAGCCATCGC TATCACTAAT 2 40 

CAGCCAGGTT TGAGCGTTAC TTTAATAGAG GGTTTGATGA TGGCAAAAGC CTTGAGCTTG 3 00 

TCTTTGAATT TACCCTTGAT TTTGGAAGAT CATTTGAGAG GGCATGTGTA TTCGCTCTTT 3 60 

ATCAATGAAA AACAAACCCG CATGCCTTTA AGCGTGCTGC TAGTCTCTGG GGGGCATTCT 4 20 

TTAATTTTAG AGGCTAGAGA TTATGAAGAC ATT AAAATC G TTGCCACGAG TTTAGACGAT 4 80 

AGCTTTGGGG AGAGTTTTGA TAAGGTTTCA AAAATGC TTG ATTTAGGCTA TCCAGGAGGC 54 0 

CCCATAGTGG AAAAATTAGC CCTTGATTAT GCACACCCAA ACGAGCCTTT AATGTTCCCT 6 00 

ATCCCTTTAA AAAACAGCCC GAATTTGGCT TTTAGTTTTT CAGGTTTAAA AAATGCGGTG 6 60 

CGTTTGGAGG TTGAAAAAAA CGCCCATAAT TTGAACGATG AGGTAAAACA AAAGATTGGC 7 20 

TATCATTTTC AAAGCGCGGC TATCGAGCAT TTAATCCAGC AGACTAAACG CTATTTTAAA 7 80 

ATCAAACGCC CTAAAATTTT TGGCATTGTG GGGGGAGCGA GCCAAAATCT AGCCTTAAGA 84 0 

AAGGCGTTTG AGGATTTGTG TGCTGAGTTT GATTGCGAGC TTGTTTTAGC CCCTTTAGAA 9 00 

TTTTGCAGCG ACAATGCCGC CATGATAGGG CGATCAAGCC TAGAAGCTTA TCAAAAAAAG 9 60 

CGCTTTATCC CTTTAGAAAA AG CCG AT ATT TCGCCAAGAA CGCTGTTAAA AAATTTTGAG 10 2 0 

TGA 1023 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 
<B) LOCATION 1...507 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 



ATGTTATCTT CTAATGATTT GTTTATGGTC GTTTTAGGGG CGATTTTATT GGTGTTGGTG 6 0 

TGCTTGGTGG GGTATTTGTA TCTTAAAGAA AAAGAGTTTT ACCATAAAAT GAGGCGTTTA 120 

GAAAAAACTT TAGATGAATC CTATCAAGAA AATTATCTCT ATTCTAAGCG TTTGAGAGAA 180 

TTAGAGGGGC GTTTGGAAGG CCTTTCTTTA GAAAAAAGCG CTAAAGAGGA CAGCTCATTA 24 0 

AAAACGACTC TTTCGCACCT TTATAACCAG TTGCAAGAAA TCCAAAAATC CATGGATAAA 3 00 
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GAGCGCG/^T^ ACTTAGAAGA AAAAATCATT MY TTRG AAAA CAAWTTTWAA GACATGGGGC 3 60 

ATTATGCCGC TAGCGATGAA GTCAACGGAA AAACAGGTTT TGAAAATGTA TCAAGAAGGC 42 0 

TATAGCGTGG ATTCTATTTC TAAAGAATTT AAAGTGAGTA AGGGCGAGGT GGAATTTATA 4 80 

TTGAACATGG CAGGGTTAAA ATGGTAG 507 

(2) INFORMATION FOR SEQ ID NO: 99: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 366 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

\ J. V } tt^t X — — — ' — - - ' ^ 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f ea ture 

(B) LOCATION 1 . . . 366 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99 

TTGGATCCCT TTAGCCATAA GGAGAATTTT TTAGCCGTTG AAACCTTTAA AATGCTAGGC 60 

AAAACAGAAA GTAAAGACAA TCTTAATTGG ATGATCGCTT TGATCATTGA AAAAGACAAG 120 

GTCTATGAGC AAGTGGGATC GGTGCGTTTT GTGGTGGTTG TAGCGAGTGC TATCATGGTG 180 

TTAGCCTTAA TCATAGCGAT CACTCTTTTA ATGCGAGCGA TCGTGAGCAA TCGTTTGGAA 24 0 

GTCGTTTC^A GCACC^TGTC TCATTTCTTT AAATTATTGA ACAATCAAKC CC ATTCTAGC 3 00 

RACAYTAAAT TGGTTRAAGC GCGATCTAAT GACGAATTAG GGCGCAYGCA AACASCTGAT 3 60 

YAATAA 366 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

Eix) FEATURE : 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1...450 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 

ATGGAAT^TT ATCAAGTCTA TGACCCATTA GGCCATATTT GGCTGAGCGC TTTAGTCGCA 60 

CTTTCGCCTA TTGCGCTCTT TTTTATCTCT CTTATTGTCT TTAAACTTAA AGGGTATAGC 120 

GCTGGG^TT TAAGCTTAGC GCTTTCAATC CTTATTGCGT TATTTGTGTA TAAAATGCCT 180 

GTTCAAATGG TGAGCGCGAG TTTTTTCTAT GGCTTTCTTT ATGGCTTGTG GCCGATCGCA 240 

TGGATTGTGA TCGCTGCGAT TTTTCTTTAC AACCTTTCAG TGAAGTCCGG GTATTTTGAG 3 00 

AT^TTAAAAG AAAGCATTTT AAGCTTGACG CCGGATCATC GCATTTTAGT GATTTTGATC 3 60 

GGGTTTTGTT TTGGCTCGTT TTTAGMMGGC GCGRTTGGTT TTGGAGGCCC GGTAGCRATC 42 0 

ACAGCGGCGA TTTTAGTGGC CTTGGGCTAA 450 

(2) INFORMATION FOR SEQ ID NO: 101: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iiij HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

( A ) N Ar! 

(B) LOCATION 1 ... 978 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 



ATGAAAAGAA 
CATAATTTGG 
TGGAGGAGCA 
CTCACTAAAA 
ATTAGAAACT 
GGAGGCCCAG 
GCGCCCTATG 
ATCAGCGTAA 
ATCAACATCA 
ACTTTTTGGG 
AAAAGCTTAG 
AAGCATTTTG 
AGCCCTACGG 
AAAATC AC C G 
ATAGCCGCTT 
AGAGCGAAGC 
GGGGGATTTC 



TTTTAGTTTC 
AAAGGGTGGA 
AGGAAGTTAG 
GCGCCAATCA 
CTACCGGTAT 
GGCATTCTAA 
TTGAAATTGG 
CTAAGGGTGG 
TCACCAAAGG 
GCAAGTCTGA 
TTAATAACAT 
GAATCCAAGC 
ATATTCAAAA 
CTTTTTTTCA 
ACAATCAAAA 
GATGGGGAGC 
ACTTTTAG 



TTTGGCTGTT 
AGCTTCAGGG 
AAATTATATG 
AAGCATTGAA 
TGGAGCTGTG 
TACGGGAATG 
CACGGTTATT 
GGAGAGCGTG 
CATTCCTACC 
AAACGGGGGC 
GCTTTTTAAC 
TCAAGTCAAT 
TTACATGTTA 
ATATTATAGT 
TCGTTTTCAA 
TGTGTATCAA 



TTGAGTCATA 
GTGGCTAACG 
GGTTCTCGCA 
GAAGCTTTGC 
CCTAGCATTT 
ATTCTAGTCA 
TTTCCTGTAA 
CGTTATGGCC 
AATTGGGAAA 
TTTTTCAATC 
ACCTATTTAA 
TGGCTCAAAG 
GATTCATTGT 
TATTTCTTGA 
AACAACCGCC 
AACTTTTTTG 



GCGCGCATGC 
ATAAAGAAGC 
CGGTGATTTC 
AAAATGTGCC 
CCATTAGGGG 
ATGGGATTCC 
CCTTTCAGTC 
CTAACGCTTT 
GTCAGGTGAG 
AAAATTCTAA 
GAACGGGGGG 
GGCAAGGGTT 
ATCAAATCAA 
CAGACCCTGG 
CCAATAACGA 
GGGACACGGA 



TGTCAAAACT 
GCCTTTAAGC 
TAACAAGCAA 
AGGCGTGCAT 
GTTTGGTGCT 
TATTTATGTC 
TGTGGATAGA 
TGGCGGTGTG 
CGAGAGGACC 
AAACATTGAT 
TATGATGAAT 
TAGATACAAC 
TGATAGCAAT 
ATCTTTAGGC 
TAAAAGCGGG 
TAGGGTAGGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
978 



(2) INFORMATION FOR SEQ ID NO: 10 2: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : double 
(D> TOPOLOGY: circular 

{ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANT I- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : mis c_ f ea cure 

(B) LOCATION I . . .7 59 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 



TTGCGTTCAA TTTCAAGGAT AAAGATG CTT TCAGTGTATG AAAAAGGGAA TGCCCTAGAC 6 0 

AAAAGGGTGC TTGAAGAATG GCTTTTAAGC GAAGACATTT TAATGGAAAA CGCCGCTATG 12 0 

GCTTTAGAAA GGGCGGTTTT ACAAAACGCT TCTTTGGGCG C T AAGGTC AT TATTCTTTGT 180 

GGGAGTGGGG ATAATGGAGG TGATGGCTAT ACTCTAGCCA GGCGTTTAGT GGGGCGTTTT 240 

AAAACGCTGG TCTTTGAAAT G AAATT AG C A AAAAGCCCCA TGTGCCAATT GCAAAAAGAA 3 00 

AGGGCTAAAA AAGTAGGGGT AGTCATCAAA GCATGGGAAG AAAAGAATGA AGATTTAGAA 3 60 
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TGCGATGTGT TAGTAGATTG CGTGGTAGGG AGCGCTTTTA AGGGCGGATT AGAGCCGTTT 4 20 

TTAGATTTTG AAAGCCTTTC TCAAAAAGCA CGCTTTAAAA TCGCTTGCGA CATTCCTAGC 4 80 

GGGATAGATT CTAAAGGCAG GGTGGATAAG AGGGCGTTTA ARGSCGGATA CCGACTATCA 54 0 

GCATGGGCGC TATTCAAGTC ATGCTTACTA AGCRATAARR CTAAARACTA TATARRRRAA 600 

TTGAAARTRR RRCATTTARR RGTTTTTAAT CAAATTTATG AGATCCCAAC ARACACTTTT 6 60 

TTACTMGAAA AAARCGATTT GAARCTGCCC TTAAGGGATA GAAAGAAACG CTCACAAAGG 720 

CGATTACGGG CATGCGCATG TGCTTTTGGG CAAGCATAG 7 59 



(2) INFORMATION FOR SEQ ID NO: 103: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



<ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANT I - SENSE : NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
(BJ LOCATION 1 ... 417 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 

ATGGCATTAG ACAAAAGGAT TTGGATGCAY TTTGATCTTT TGCCTTTTGT GTTTATCATC 6 0 

CCCTTGTTGG TGG TTTCTTT TTTGTTGATT TTTGAGAGTA GTGCGGTTTT GAGCTTGAAG 12 0 

CAAGGGGTTT ATTATGCCAT AGGGTTTCTT CTCTTTTGGG TAGTGTTTTT TATCCCTTTC 18 0 

AGGAAACTCG ATCGGTGGCT CTTTGCGCTT TATTGGGCGT GCGTTATTTT ATTAGCGTTA 24 0 

GTGGATTTTA TGGGATCGAG CAAGCTTGGA GCGCAGCGAT GGCTAGTCAT TCCTTTCACT 300 

TCTATCACCT TACAGCCTAG CGAGCCTGTG AAAAATCGCY ATTCYTTTAT TGTTGGCGCA 3 60 

TTTGAKYAAA ATYAACCCGA CCYCCTTTTA AGGGCTATGA TTGGGGCATG TTTTTAA 417 



(2) INFORMATION FOR SEQ ID NO: 104: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 base pairs 
£B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 
(3) LOCATION 1 ... 981 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 

GTGTTAATGG CGTTGARCGA TAAACGCTAC GGCTTAGAAG CAGGGATCAA GTATTTCACC 60 

ATGGGGGCGA TGGCGAGCGC GTTTTTTGCT ATGGGCGCGA TGGCTTTTTA CCTGCTTACA 120 

GGGAGCTTGA ATCTTGAAGT CATTACCCTA TACTTACACA CTGAGGGCAT CACAAACCCC 180 

ATGCTCTTTG CGATGGGCAC TATTTTTTTG ATTGGAGCGA TTGGCTTTAA GGTTTCTTTA 2 40 

GTGCCTTTCC AT AC CTGG AT GCCTGATGTG TATGAGGGYA ATAACCCAGT CTTTGCGAGC 3 00 

TATATTTCCA TTGTGCCTAA AATCGCTGGC TTTGTGGTAG CGACTCGCCT TTTTGGGGCG 3 60 

TTTATAGACA CTCATACCGC TTGGGTAGAA GACATTTTTT ATGTTTTGAT CCTTATGACT 42 0 
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ATCACCATCC CTAATTTCAT TGCTTTATGG CAAGAAGATG TCAAAAGGAT GCTCGCTTAT 4 80 

AGTTCTATTT CGCATTCTGG GTTCGCTTTA GCGTGCGTGT TTATCCACAC TGAAGATAGC 54 0 

CAACAAGCGA TGTTTGTTTA TTGGTTCATG TTTGCCTTCA CTTACATTGG GGCTTTTGGC 60 0 

CTTTTATGGC TCTTAAAAAG CCGGGAAAAA ACATGGGATG AACGCTACGA TCACCCCTAT 660 

TCTAAATTCA ACGGCCTTAT CAAAACCCAC CCCTTAGTGG CGATCTTGGG CGCTATTTTT 72 0 

GTTTTTGGGC TTGCAGGGAT CCCGCCTTTT AGCGTGTTTT GGGGGAAATT TTTAGCCGTT 78 0 

GAAAGCGCGT TAGAGAGCAA TCACATTCTT TTAGCGGTGG TGATGTTAGT TAATAGCGCG 84 0 

GTGGCTGCGT TTTATTATTT CCGTTGGCTC GTGGCGATGT TTTTCAATAA GCCCTTACAA 9 00 

ACCCAAAGCT ACGCCAAAAC GATATTTACA CCCAAAACGC CACCATGCCC ATTTATGCGG 96 0 

TCATTATTGC CATGGCGTTA G 981 



(2) INFORMATION FOR SEQ ID NO: 105: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 
{3) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
{ D ) TOPOxjOG/: Ciirculciir 

(ii) MOLECULE TYPE: DNA {genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
(3) LOCATION 1 . . .723 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105 



ATGATAAACT 
GACAGAAATG 
AAGGCGTTTT 
CAAAAATTGC 
AAGCCTGAAA 
TTAATCAATA 
GACTGGGAAG 
TTGCCCTCTA 
ACTTACGCCT 
TCTYYAAATT 
GGGTGTGTGG 
AAATCCGGCT 
TAG 



CAAAGAAAAG 
GCGCACATTT 
TACAAAAAAA 
AGCTCGCTTA 
CTAAGCGAGC 
ACGCCGGCTT 
TCATGATAGA 
TGATAGAGCA 
ATCCTGGAGG 
TGCGAGCGGA 
CGAAACCGAA 
ATGAAAAACA 



C TTG AAAAAG 
TAGTTGCGGA 
CCATGTGGTT 
CCCTAAGCGT 
GATAGAAACT 
AGCGCTAGGC 
CACGAATATC 
TGACCAAGGG 
G AAKGTC TAT 
CGTGGCTGGC 
AGTCAGCAGG 
CCCCTTACCC 



GGCTTAAGGG 
GCGACTTCAG 
TTTGGCACAG 
TTCATTCCCC 
ATTTTTTCCA 
TTGAACAAGG 
AAGGGGTTGT 
ACTATCATCA 
GGAGCGAGCA 
ACTAACACTA 
GTGCGGGGTA 
CAAACCACAA 



GCTTTTTTAA 
GGTTTGGGCT 
GGAGGCGGCA 
TGTGTTTTGA 
TGACGGATCG 
CTTATGAATG 
TGCATCTCAC 
ATCTTGGTTC 
AGGCGTTKGT 
GAGGGAGAAG 
AAGGCGATAA 
GACAAGGGCT 



AATTTTAAAG 
AGAAATCGCT 
AGAGAATTTA 
TCTTCAAAAC 
CATTGACGCT 
CGAGTTAGAC 
CCGCTTGATC 
TATCGCTGGC 
GAAACAAYYY 
GTGGAACCCG 
ACCAAAGCCC 
AACATCGGGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
723 



(2) INFORMATION FOR SEQ ID NO: 10 6: 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 ... 615 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106 



GTGAGCGGGG 
ATCACTTATC 
TTAGACGTGT 
TCGCTCGCTG 
AAGCTCAAAA 
AAATTAGGCT 
GCGCTAAAAG 
AATTCCATGA 
AGGATCGCTT 
GATCAGGCGA 
AATCTAWCGA 



TGGTGTTAAG 
AATTGGGCTT 
TTATGCCTGA 
AAAAAACCGC 
AAGGGC AATT 
CTATGAGTTC 
ACACGGATTT 
CGAAAAAAGA 
TAGGGARCGG 
GCAAAATGGC 
KCTAG 



CAAATTTGAT 
ACCCTTGCGT 
AAGGATTGTG 
CAGCGTTTTA 
CACTTTCAAC 
TCTRATCTCT 
AGAAAGTTCT 
GCGAGAAAAC 
CTTAGAAGYG 
GAAACGACTC 



AGCGATTCTA 
TTTATTGGGA 
GGGCGTTTGA 
AACCCTAATG 
GATTTTTTAA 
ATGATTCCAG 
TTAGAAGTGA 
CCCGAGATTT 
YCTG AAATC A 
ACGAATAAAA 



AAGGGGGTAT 
GTGGGGAAAA 
TGGGGGCTGG 
AAGCCAAAGA 
ACCAAATTGA 
GTTTAGGGAA 
AAAAAATCAA 
TAAACGGCAG 
ATCGCATCAT 
AGGGTATTAG 



CGCCTTAGGC 
AATCCCTGAT 
AGATATTATC 
TTTAAGCAAA 
AAAAGTGAAA 
TATGGCAAGC 
GGCCATGGTT 
CCGAAGAAAA 
CAAACGCTTT 
CGATYTGATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
615 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH : 279 bdbe ynii^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE; 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . . 279 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107 

GTGGAAAAAG CCCATCCGGA TGTGTTTAAC CTCTTGTTAC AGGTTTTAGA TGAGGGGCAT 
TTAACCGATA GTAAGGGCGT GAGGGTGGAT TTCAAAAACA CGATTTTGAT TTTAACCAGC 
AATGTGGCTA GCGGCGCGCT TTTAGAAGAG GATTTGAGTG AAGCCGATAA ACAAAAAGCG 
ATCAAAGAGA GCCTGAGACA ATTCTTCAAG CCGGAATTTT TAAACCGCTT AGATG AAATC 
ATCTCCTTTA ACGCCC TAG A TAGTCATGCT ATCATCTAA 

(2) INFORMATION FOR SEQ ID NO: 108: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE : DNA (genomic) 



(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 246 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 

TTGGTGTTTT TAGACAGGCG TTTGATTGTG ATGGTTACGG ACTCTAAAGG GAGTCGTTAT 6 0 

ATTAATGTGC ATATCTTATT CCGCCAAATC AGTTTGTATG CGCTGTTGAG CGTTGTGGGA 12 0 

TCTTTATTGT TTTTAGGCGT TTCATTACTG GTTTTAAATA AAGAAATTAA AAACATTGAA 180 



BNSDOCID: <WO 9719098A1_I_> 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCT/US96/ 18542 



127 

AAACAGCATG CTTTAMTCAC TAAGGAATTT GAGAAAAAAA GAGAGACGAA TGAAMAGCTT 24 0 

TCTTYG 246 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA {genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

ivi) ORIGIwal. SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . .702 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 

TTGAGTTTGA TGASTGTGTT AAATGCCAAA GAATGCGTTY CGCCCATAAC AAGAAGCGTT 6 0 

AAGTATCATC AGCAAAGTGC SGAGATCAGA GCCTTGCAAT TACAAAGTTA CAAAATGGCG 12 0 

AAAATGGCGC TAGACAATAA CCTTAAGCTC GTTAAAGACA AAAAGCCAGC CGTCATCTTG 180 

GATTTAGATG AAACCGTTTT GAACACTTTT GATTATGCGG GCTATTTAGT CAAAAACTGC 24 0 

ATT AAAT AC A CCCCAGAAAC TTGGGATAAA TTTGAAAAAG AAGGCTCTCT TACGCTCATT 3 00 

CCTGGAGCGC TAGACTTTTT AGAATACGCT AATTCTAAGG GCGTTAAGAT TTTTTACATT 3 60 

TCTAACCGCA CCCAAAAAAA TAAGGCATTC ACTTTAAAAA CGCTCAAAAG CTTTAAGCTC 42 0 

CCCCAAGTGA GTGAAGAATC CGTTTTGTTA AAGGAAAAAG GCAAGCCTAA AGCCGTTAGG 4 80 

CGGGAGTTAG TCGCTAAGGA TTATGCGATT GTTTTACAAG TGGGCGACAC TTTGCATGAT 540 

TTTGACGCCA TTTTTGCTAA AGACGCTAAA AACAGCCAAG AACAACAAGC CAAAGTCTTG 6 00 

CAAAACGCTC AAAAATTCGG CACAGAATGG ATCATTTTAC CC AACTCTCT TTATGGCACA 660 

TGGGAAGATG GGCCTATAAA AGCATGGCAA AATAAAAAAT AA 702 

(2) INFORMATION FOR SEQ ID NO: 110: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 258 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 ... 258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 

ATGTTGGCGG CTGGTTTGAC TTTGCCTGAA TTTGGGTGTT ATTTAAGCCA TTATCTTTTA 6 0 

TGGAAAGAAT GCGTCAAATT AGATCAACCG GTCGTTATTT TAGAAGATGA TGTAACGCTA 12 0 

GARTCTCATT TCATGCAAGC CTTAGAAGAT TGTTTGAAAA GCCCTTTTGA TTTTGTG AG A 180 

CTCTATGGGT GTTATTGGTA TTACCAACGA GACAAAATTC CATGTTTTGC CCAAAGAATT 24 0 

TGTATTTCCT CCCTTTGA 258 

(2) INFORMATION FOR SEQ ID NO: 111: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

in) NAHE/KEY: iniSC_featui'e 

( B) LOCATION 1 . . .348 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 

TTGATCGCTT TGAGAGTAAC GGCTTGGAAR GTGGYGGCCA TGAAACGCTT GCATTTGAGC 60 

GTGAAAGACG CTGAAAACTT TGATGCGATC CTCAGAGAGA GACCCTTTTT TAAAGATTTG 12 0 

ATAGAGTTTA TGGTGAGTGG TCCGGTGGTG GTTATGGTTT TAGAAGGCAA AGATGCGGTG 18 0 

GCTAAAAACA GAGAGCTTAT GGGAGCGACT GATCCCAAAC TCGCCCAAAA AGGTACTATC 24 0 

AGAGCGGATT TTGCTGAGAG CATTGACGCT AATGCGGTGC ATGGGAGCGA TAGCTTGGAA 300 

AACGCGCACA ATGAAATCGC TTTCTTTTTT GCCGCTAGAG AGTTTTAA 34 8 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

( iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . . 1185 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 

TTGATGTGGC TCAAAACGCT TACACTTCAA ACGCTCAATA CCGACAAAGC CTTGCAAGAA 60 

TTTTC TAAAA CGATGGAGGC GTTTAAAACC AAACTCATCC AATCCG CTAA CGATGTGCAT 12 0 

TCAGAGACTT CTCGCGCCGC TATCGCTAAC GATTTAGAAC GCTTAAAAGA GCATATGATA 180 

AATGTCGCTA ATACCTCCAT AGGGGGGGAA TTTTTATTTG GAGGCAGTAA GGTGGATAGA 2 40 

CCCCCCATTG ATAGTAATGG GAAATACCAT GGCAATGGCG AAGATTTAAA CGCGCTTATT 3 00 

AGCTCTGATA ACCTTGTGCC TTATAATATC AGCGGGCAAG ATTTGTTTTT AGGCACCGAT 3 60 

AAAGACAAAC ACAAACTCAT TACCACCAAC ATTAAATTAC TCAATCAAAA CAAGCTCCAM 42 0 

CCTGATGTGA TGGACGCTTT AGAGCATTCT TCATTGCCTG AAGAAGTTTT CATTAAACCC 480 

AGCGATACCT TGCGAGAACT CATCGGCGAT AACGATAAAA ACCCCACCAA TGACCCTAAA 54 0 

GAGTTTTTTT ATTTGCAAGG CATTAGGCCT GATGGCTCTA GTTTTAAAGA AAAATTCGCG 600 

TTGGATAAAG CCTATCAAAA CCAAGAAAGC GCGACTAAAG TGAGCGATTT GTTGGATAAA 660 

ATCGGGCATG CTTACGGGAA CACTTCGCAA AATAAAGTCG TGGATGTGAG TTTGAACAAT 720 

TGGGGGCAAA TTGAGATTAA AAACCTAACC CCCGGCAGTG AAAATTTGGA TTTTCATTTG 78 0 

ATTTCTAGCG ATGGGGATTT TGACGATTTA GACGCCTTGC GTTCGAGCGG TAAAAGGGTT 84 0 

ACTGAATATG TCAAAAGCGC GTTTGTAACG GATAGGAGTT TGAGCCAAGT TAAAGCGGTG 9 00 

CCTAACATGT ATAACCCTAA AGTGCTTGAA ATCCCTAGCG TGTTTGTGAC TAAAGACAAT 9 60 

GTTTTAGCTA ACAAAAACAC C AAGTTG AG C GAGATTTTTG GMGATAAGGT GGAAACTTTA 102 0 
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AAAATCAACG CCAGCCGTTT GGGCGATGAA AGCGCTATTA AAATCCCAAA CCTCCCTATT 
AATTTGGATA TTCCCATTCT TTTAGATGTG AAAAACTCTA CGATTAAAGA TTTGAAAGAC 
GCGATTAAAG AACGCTTCAA TAATGAAGGT GGATGTGGAA ATTGA 



1080 
1140 
1185 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

( iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

( A ) NAME /KEY : misc_feature 

(B) LOCATION 1 ... 309 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 

TTGAAGGCAT TAAACGACTG CATGGTATTT TTTCATAAGA AAATTATTTT AAATTTTATC 60 

TATTCTTTAA TGGTTGCTTT TTT ATTC CAT TTATCCTATG GGGTTCTTTT AAAAGC CG AT 120 

GGAATGGCTA AAAAGCAAAC TCTTTTAGTG GGTGAAAGGC TTGTGTGGGA TAAGCTCACG 180 

CTGTTAGGGT TTTTAGAAAA AAACCATATC CCCCAAAAAC TCTACTACAA TTTGAGCTCT 240 

CAAGATAAAG AATTGAGTGC TGAAATCCAA AGCAATGTTA CCTACTACMA CTTTAAGAGA 3 00 

TGCAAATAA 3 09 

{2} INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1092 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc.feature 

(B) LOCATION 1...1092 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 

ATGAAATTTT TTCTTTTAAA GAAATTCAGC RAATTTTTAA ACACTCAAAC GCATTTTAAC 60 

CTCAAACGCT TGAACGCGTC TAGTTTTTTA TTAGAGACTT TTTCTAAAGA AAAACACGCC 12 0 

TTTGTTGTGG ATTTGAGCGC GCCTTATATT GGTTTGTCCA AAAAACCCCC AGAGAGCGTT 180 

TTAAAAAACA CTTTAGCGTT AGATTTTTGT TTGAATAAAT TCACCAAAAA CGCCAAAATT 24 0 

TTACAAGCAA ACGTCATTGA TAACGATCGG ATTTTAGAAA TCAAGGGCGC TAAAGATTTA 3 00 

GCTTATAAGA GTGAAACTTT TATTTTGCGT TTAGAAATGA TCCCTAAAAA AGCCAATCTC 3 60 

ATGATTTTAG ATCAAGAAAA ATGCGTGATA GAGGCTTTTC GTTTTAATGA CAGGGTGGCT 42 0 

AAAAACGATA TTTTAGGGGC ATTGCCTCCT AATATTTACG AGCATCAAGA AG AG G ATTTG 4 80 

GATTTTAAGG GATTGTTGGA CATTTTAGAA AAAGATTTTT TATCCTATCA GCACAAAGAA 54 0 

TTGGAACACA AAAAAAATCA AATCATCAAG CGATTAAACG CCCAAAAAGA ACGCTTGAAA 6 00 

GAAAAATTAG AAAAACTAGA AGATCCTAAA ACTTTACAGC TGGAAGCGAA AGAATTGCAA 6 60 

ACTCAAGCCT CATTGTTGCT CACTTACCAG CATTTAATCA ACAGGCGTGA AAATCGCGTG 720 
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ATTTTAAAGG ATTTTGAAGA TAAAGAATGC ATGATTGAAA TTGATAAGAG CATGCCCTTA 7 80 

AACGCTTTTA TCAATAAAAA ATTCACTCTC AGCAAGAAAA AGAAACAAAA ATCGCAATTC 34 0 

TTGTATTTAG AAGAAGAGAA TCTGAAAGAA AAAATCGCTT TTAAGGAAAA TCAAATCAAC 9 00 

TATGTTAGAG ACGCTGCAGA AGAAAGCGTT TTAGAAATGT TTATGCCGGT AAAAAACTCT 9 60 

AAAATCAAAC GCCCGATGAA CGGGTATGAA GTGCTGTATT ATAAGGATTT W AAA WC G G G T 102 0 

TTRGGAAAAA CCAAAAAGAG AATATCAAGC TTTTACAAGA CGCAARARCG AATGATTTKT 108 0 

GGATGCAYKT GA 109 2 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

till) HYPOTHETICAL: YhlS 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 ... 172 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


»:115 












Met 


Lys 


Gly 


Pro 


He 


Leu 


Trp 


Pro 


Ala 


Phe 


Ser 


Gin 


Phe 


Ser 


Asp 


Gin 


1 




5 










10 










15 




Asp 


Leu 


Ser 


Asp 


He 


Val 


Ala 


Tyr 


Leu 


Thr 


Ser 


He 


Leu 


Pro 


Lys 


Asn 






20 










25 










30 






Leu 


Ser 


Asp 


Lys 


Glu 


Val 


Phe 


Ala 


Gin 


Ser 


Cys 


Gin 


Arg 


Cys 


His 


Ser 






35 








40 










45 








Leu 


Asp 


Tyr 


Ala 


Lys 


Asp 


Lys 


Ala 


Phe 


Ser 


Asp 


Pro 


Lys 


Asp 


Leu 


Ala 




50 










55 










60 










Asn 


Tyr 


Leu 


Gly 


Ser 


His 


Ala 


Pro 


Asp 


Leu 


Ser 


Met 


Met 


He 


Arg 


Ala 


65 






70 










75 










80 


Lys 


Gly 


Glu 


His 


Gly 


Leu 


Asn 


Val 


Phe 


He 


Asn 


Asp 


Pro 


Gin 


Lys 


Leu 






85 










90 










95 




Leu 


Pro 


Gly 


Thr 


Ala 


Met 


Pro 


Arg 


Val 


Gly 


Leu 


Asn 


Glu 


Lys 


Ala 


Gin 






100 










105 










110 






Lys 


Gin 


Val 


lie 


Ser 


Tyr 


Leu 


Glu 


Lys 


Ala 


Gly 


Asp 


Arg 


Lys 


Lys 


His 




115 










120 










125 








Glu 


Arg 


Asn 


Thr 


Leu 


Gly 


He 


Lys 


He 


Met 


He 


Phe 


Phe 


Ala 


Val 


Leu 




130 










135 










140 










Ser 


Phe 


Leu 


Ala 


Tyr 


Ala 


Gly 


Lys 


Glu 


Lys 


Phe 


Gly 


Ala 


Lys 


Cys 


He 


145 








150 










155 










160 


Lys 


Phe 


Lys 


Lys 


Gly 


Gly 


Thr 


Trp 


Phe 


Tyr 


Asp 


Phe 
















165 










170 















(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : mis c_ feature 
(B> LOCATION I ... 61 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 6 

Met Gin Glu Phe Ser Leu Trp Cvs Asp Phe lie Glu Arg Asp Phe Leu 

IS 10 15 

Glu Asn Asp Phe Leu Lys Leu lie Asn Lys Gly Ala lie Cys Gly Xaa 

20 25 30 

Thr Ser Asn Pro Ser Leu Phe Cys Glu Ala lie Thr Lys Ser Ala Phe 

35 40 45 

Tyr Gin Asp Glu He Ala Lys Xaa Gin Arg Gin Lys Ser 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

{ix> FEATURE: 

(A) NAME/ KEY : misc_f eatur e 

(B) LOCATION 1 . . .2 86 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117 



Leu 


Xaa 


Pro 


Met 


Lvs 


Val 


He 


Gin 


Val 


Phe 


Leu 


Phe 


Ser 


Asn 


Pro 


Phe 










5* 










10 










15 




Cys 


Ala 


He 


Val 


Pro 


Asn 


Thr 


Glu 


Pro 


Glu 


His 


Leu 


Glu 


His 


Tyr 


Asp 






20 










25 










30 






His 


Asp 


Leu 


Glu 


Arg 


Phe 


Phe 


Phe 


Ala 


Tyr 


Lys 


Tyr 


Phe 


Leu 


Asp 


His 






35 










40 










45 








Ala 


Gin 


Lys 


Arg 


Val 


He 


Tyr 


Lys 


Glu 


Asp 


Pro 


Phe 


Leu 


Lys 


Asn 


Tyr 




50 










55 










60 










Ser 


Lys 


Asp 


Ala 


lie 


Val 


Leu 


Glu 


Lys 


Lys 


Asp 


He 


Tyr 


Asn 


He 


Gin 


65 








70 










75 










80 


Tyr 


lie 


Leu 


Lys 


Asp 


Gly 


Glu 


Pro 


Tyr 


Thr 


Ser 


Phe 


Glu 


Leu 


Lys 


Asn 










85 










90 










95 




Leu 


Gly 


Ala 


Phe 


Leu 


Val 


Trp 


Gly 


Leu 


Gly 


Glu 


His 


Asn 


Ala 


Thr 


Asn 








100 










105 










110 






Ala 


Ser 


Leu 


Ala 


He 


Leu 


Ser 


Ala 


Leu 


Asp 


Glu 


Leu 


Asn 


Leu 


Glu 


Glu 






115 










120 










125 








He 


Arg 


Asn 


Asn 


Xaa 


Leu 


Asn 


Phe 


Lys 


Gly 


He 


Lys 


Lys 


Arg 


Phe 


Asp 




130 










135 










140 










He 


Leu 


Gin 


Lys 


Asn 


Asn 


Leu 


He 


Leu 


He 


Asp 


Asp 


Ti- 


Ala 


His 


His 


145 










150 










155 










160 


Pro 


Thr 


Glu 


He 


Gly 


Xaa 


Thr 


Leu 


Lys 


Ser 


Ala 


Arg 


Ile 


Tyr 


Ala 


Asn 










165 










170 










175 




Leu 


Leu 


Asn 


Thr 


Gin 


Glu 


Lys 


He 


He 


Val 


He 


Trp 


Gin 


Ala 


His 


Lys 








180 










185 










190 






Tyr 


Ser 


Arg 


Leu 


Met 


Asp 


Asn 


Leu 


Glu 


Glu 


Phe 


Lys 


i»y £ 


Cys 


Phe 


Leu 






195 










200 










205 








Glu 


His 


Cys 


Asp 


Arg 


Leu 


He 


He 


Leu 


Pro 


Val 


Tyr 


Ser 


Ala 


Ser 


Glu 




210 










215 










220 










Val 


Lys 


Arg 


Asp 


He 


Asp 


Leu 


Lys 


Ala 


His 


Phe 


Lys 


His 


Tyr 


Asn 


Pro 


225 








230 










235 










240 


Thr 


Phe 


He 


Asp 


Arg 


Val 


Arg 


Lys 


Lys 


Gly 


Asp 


Phe 


Leu 


Glu 


Leu 


Leu 










245 










250 










255 




Val 


Asn 


Asp 


Asn 


Val 


Val 


Glu 


Thr 


He 


Glu 


Lys 


Gly 


Phe 


Val 


He 


Gly 








260 










265 










270 






Phe 


Gly 


Ala 


Gly 


Asp 


He 


Thr 


Tyr 


Gin 


Leu 


Arg 


Gly 


Glu 


Met 










275 








280 










285 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: a^ino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 61 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 

Leu Leu Leu Phe Phe Leu Leu Lys Gly Val Val Phe Ser Leu Gly Phe 

15 10 15 

Phe Se^- Phe Phe Glu Glu Val Ser Gly Ser Phe Xaa Ala Val Ser Leu 

20 25 30 

Xaa Val Leu Ala Leu Val Met Gly Ser Ser Xaa Gly Leu Glu Glu Phe 

35 40 45 

Cys Val Leu Glu Glu Leu lie Asn Ser Gly Leu Ser Val 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: II 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc.feature 

(B) LOCATION 1 . . . 122 





(Xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


119 












Met 


Gly 


Phe 


Leu 


Lys 


Val 


Leu 


Lys 


His 


Asp 


Ala 


Leu 


Gly 


Gin 


Val 


Gly 


1 






5 










10 










15 




Asn 


He 


Val 


He 


Gly 


Asn 


Phe 


Leu 


He 


Thr 


Leu 


Thr 


Val 


Leu 


Ala 


Val 








20 








25 










30 






Cys 


Phe 


Ser 


Ser 


Gin 


Ser 


Ala 


Glu 


Glu 


Thr 


Thr 


Met 


Leu 


Thr 


Leu 


Ser 




35 










40 










45 








Tyr 


Thr 


Leu 


Phe 


Phe 


He 


Leu 


Gly 


Ala 


Phe 


Leu 


Leu 


Val 


Ala 


He 


Ser 


50 










55 










60 






Gly 


Val 


Val 


Gly 


Ala 


He 


Lys 


Asn 


Leu 


Asn 


Ala 


Leu 


Phe 


Ser 


Lys 


Arg 


65 






70 










75 










80 


Leu 


Ser 


Phe 


Ser 


Leu 


Pro 


He 


Ser 


Leu 


Glu 


Ser 


Leu 


Leu 


Leu 


Pro 


Lys 










85 










90 










95 




He 


Leu 


Leu 


Pro 


Xaa 


Val 


Phe 


Phe 


Tyr 


Leu 


Gin 


Phe 


Val 


Leu 


Val 


Cys 








100 










105 










110 






Gly 


Glu 


Arg 


Ala 


Phe 


Gly 


Leu 


Leu 


Pro 


Phe 
















115 










120 



















(2) INFORMATION FOR SEQ ID NO: 120: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 187 amino acids 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9719098A1_L> W X 



WO 97/19098 



PCT7US96/18542 



133 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(IX J FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 ... 187 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0 



Met 


Leu 


Lys 


Thr 


His 


Leu 


Ser 


Ser 


Ala 


Arg 


Gly 


Val 


Val 


Val 


Leu 


Ser 


1 
















10 










15 




Lys 


He 


Leu 


Pro 


Val 


Asn 


Val 


Val 


Leu 


Met 


Val 


Ser 


Val 


Arg 


Leu 


Phe 






20 










25 










30 






Glu 


Lys 


Glu 


Leu 


Lys 


Arg 


Lys 


Pro 


Tyr 


Tyr 


He 


He 


Ala 


Ser 


Ala 


Kis 




35 










40 










45 






Val 


Ser 


Asn 

50 


Glu 


Gly 


Leu 


Glu 


Lys 
55 


Leu 


Lys 


Lys 


Xaa 


Gly 
60 


Xaa 


Asp 


Met 


Xaa 


Xaa 


Pro 


Thr 


Lys 


Leu 


Met 


Ala 


Gin 


Arg 


Val 


Ser 


Ala 


Asn 


Xaa 


Trp 


65 








70 










75 










80 


Cys 


Xaa 


Leu 


Asp 


Met 


Glu 


Asn 


He 


Leu 


Glu 


Arg 


Phe 


He 


Asn 


Lys 


Lys 








85 










90 










95 




Asp 


Thr 


Leu 


Leu 


Asp 


Leu 


Glu 


Glu 


Val 


He 


Val 


Pro 


Lys 


Thr 


Ser 


Trp 






100 










105 










110 






Leu 


Val 


Leu 
115 


Arg 


Lys 


Leu 


Lys 


Glu 
120 


Ala 


Kis 


Phe 


Arg 


Glu 
125 


He 


Ala 


Lys 


Ala 


Phe 


Val 


He 


Gly 


He 


Thr 


Gin 


Lys 


Asp 


Gly 


Lys 


Tyr 


He 


Pro 


Met 




130 










135 










140 








Val 


Pro 


Asp 


Gly 


Glu 


Thr 


He 


He 


Ala 


Ser 


Glu 


Ser 


Lys 


Leu 


Leu 


Met 


145 






150 










155 










160 


Gly 


Thr 


Ser 


Glu 


Gly 


Val 


Ala 


Thr 


Cys 


Lys 


Gin 


Leu 


He 


Thr 


Ser 


His 








165 










170 










175 




Gin 


Lys 


Pro 


Lys 
180 


Glu 


Val 


Asp 


Tyr 


He 
185 


Ser 


Leu 













(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . . 193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121 

Val Gly Ser Phe Leu Phe Val Gly Pro Ser Gly Val Gly Lys Thr Glu 

15 10 15 

Leu Ala Lys Glu Leu Ala Leu Asn Leu Xaa Leu His Phe Glu Arg Phe 

20 25 30 

Asp Met Ser Glu Tyr Lys Glu Ala His Ser Val Ala Lys Leu He Gly 

35 40 45 

Ser Pro Ser Gly Tyr Val Gly Phe Glu Gin Gly Gly Leu Leu Val Asn 
50 55 60 
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Ala 


He 


Lys 


Lys 


His 


Pro 


His 


Cys 


Leu 


Leu 


Leu 


Leu 


Asp 


Glu 


He 


Glu 


65 










70 










75 










80 


Lys 


Ala 


His 


Pro 


Asn 


Val 


Tyr 


Asp 


Leu 


Leu 


Leu 


Gin 


Val 


Met 


Xaa 


Asn 








85 










90 










q 5 




Ala 


Thr 


Leu 


Ser 


Asp 


Asn 


Leu 


Gly 


Asn 


Lys 


Ala 


Ser 


Phe 


Lys 


His 


Val 








100 










105 










110 






He 


Leu 


He 


Met 


Thr 


Xaa 


Xaa 


Val 


Gly 


Ser 


Lys 


Asp 


Lvs 


Asp 


Thr 


Leu 






115 










120 










12 5 








Gly 


Phe 


Phe 


Ser 


Thr 


Lys 


Asn 


Ala 


Lys 


Tyr 


Asp 


Ara 


Ala 


Val 


Lys 


Glu 


130 










135 










140 










Leu 


Leu 


Thr 


Pro 


Glu 


Leu 


Arg 


Ser 


Arg 


He 


Asp 


Ala 


He 


Val 


Pro 


Phe 


145 










150 










155 










160 


Asn 


Ala 


Leu 


Ser 


Leu 


Glu 


Asp 


Phe 


Glu 


Thr 


His 


Cys 


Phe 


Cys 


Gly 


He 










165 










170 










175 




Gly 


Arg 


Val 


Lys 


Ser 


Pro 


Ser 


Thr 


Arg 


Ala 


Arg 


Arg 


Asp 


Leu 


Lys 


lie 



180 185 190 



Pro 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 03 amino acids 
( 3 ) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

t iii ) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f ea ture 

(B) LOCATION 1 . . . 303 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122 



Met 


Ala 


Phe 


Gin 


Val 


Asn 


Thr 


Asn 


He 


Asn 


Ala 


Met 


Asn 


Ala 


His 


Val 


1 








5 










10 










15 




Gin 


Ser 


Ala 


Leu 


Thr 


Gin 


Asn 


Ala 


Leu 


Lys 


Thr 


Ser 


Leu 


Glu 


Arg 


Leu 








20 










25 










30 






Ser 


Ser 


Gly 


Leu 


Arg 


He 


Asn 


Lys 


Ala 


Ala 


Asp 


Asp 


Ala 


Ser 


Gly 


Met 






35 










40 










45 








Thr 


Val 


Ala 


Asp 


Ser 


Leu 


Arg 


Ser 


Gin 


Ala 


Ser 


Ser 


Leu 


Gly 


Gin 


Ala 




50 










55 










60 










He 


Ala 


Asn 


Thr 


Asn 


Asp 


Gly 


Met 


Gly 


He 


He 


Gin 


Val 


Ala 


Asp 


Lys 


65 










70 










75 










80 


Ala 


Met 


Asp 


Glu 


Gin 


Leu 


Lys 


He 


Leu 


Asp 


Thr 


Val 


Lys 


Val 


Lys 


Ala 










85 










90 










95 




Thr 


Gin 


Ala 


Ala 


Gin 


Asp 


Gly 


Gin 


Thr 


Thr 


Glu 


Ser 


Arg 


Lvs 


Ala 


He 








100 










105 










110 






Gin 


Ser 


Asp 


He 


Val 


Arg 


Leu 


He 


Gin 


Gly 


Leu 


Asp 


Asn 


He 


Gly 


Asn 






115 










120 










125 








Thr 


Thr 


Thr 


Tyr 


Asn 


Gly 


Gin 


Ala 


Leu 


Leu 


Ser 


Gly 


Gin 


Phe 


Thr 


Asn 




130 








135 










140 










Lys 


Glu 


Phe 


Gin 


Val 


Gly 


Ala 


Tyr 


Ser 


Asn 


Gin 


Ser 


He 


Lys 


Ala 


Ser 


145 










150 










155 










160 


He 


Gly 


Ser 


Thr 


Thr 


Ser 


Asp 


Lys 


He 


Gly 


Gin 


Val 


Arg 


He 


Ala 


Thr 










165 










170 










175 




Gly 


Ala 


Leu 


He 


Thr 


Ala 


Ser 


Gly 


Asp 


He 


Ser 


Leu 


Thr 


Phe 


Lys 


Gin 






180 










185 










190 






Val 


Asp 


Gly 


Val 


Asn 


Asp 


Val 


Thr 


Leu 


Glu 


Ser 


Val 


Lys 


Val 


Ser 


Ser 






195 








200 










205 








Ser 


Ala 


Gly 


Thr 


Gly 


He 


Gly 


Val 


Leu 


Ala 


Glu 


Val 


He 


Asn 


Lys 


Asn 




210 








215 










220 










Ser 


Asn 


Arg 


Thr 


Gly 


Val 


Lys 


Ala 


Tyr 


Ala 


Ser 


Val 


He 


Thr 


Thr 


Ser 


225 








230 










235 










240 
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Asp 


Val 


Ala 


Val 


Gin 


Ser 


Gly 


Ser 


Leu 


Ser 


Asn 


Leu 








245 










250 






He 


His 


Leu 


Glv 


Asn 


He 


Ala 


Asp 


He 


Lys 


Xaa 


Asn 






2 60 










265 






Glu 


Arg 


Leu 


Val 


Thr 


Ala 


He 


Asn 


Ala 


Val 


Thr 


Ser 




275 










280 










Ala 


Tyr 
290 


Thr 


Asp 


Gin 


Lys 


Gly 
295 


Arg 


Leu 


Asn 


Leu 


Arg 
300 



135 

;n Leu Thr Leu Asn 
255 

Asp Ser Asp 

270 
Thr Gly Val 
285 



(2) INFORMATION FOR SEQ ID NO: 123: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 
<B) LOCATION 1 ... 161 





(xi ) 


SEQUENC 


E DESCRIPTION: SEQ ID NO: 123 












Met 


Phe 


Phe 


Lys 


Thr 


Tyr 


Gin 


Lys 


Leu 


Leu 


Gly 


Ala 


Ser 


Cys 


Leu 


Ala 


1 






5 










10 










15 


Val 


Leu 


Tyr 


Leu 


Val 


Gly 


Cys 


Gly 


Asn 


Gly 


Gly 


Gly 


Gly 


Glu 


Ser 


Pro 






20 








25 










30 




Ala 


Glu 


Met 


He 


Xaa 


Asn 


Ser 


Glu 


Gly 


Thr 


Phe 


Gin 


He 


Asp 


Ser 


Lys 






35 










40 










45 






Ala 


Asp 


Ser 


He 


Thr 


He 


Gin 


Gly 


Val 


Lys 


Leu 


Asn 


Arg 


Gly 


Asn 


Cys 


50 










55 










60 




Val 




Ser 


Val 


Asn 


Phe 


Val 


Pro 


Val 


Ser 


Glu 


Thr 


Phe 


Gin 


Met 


Gly 


Leu 


65 










70 










75 










80 


Gin 


Val 


Thr 


Pro 


He 


Ser 


He 


Gin 


Asp 


Phe 


Lys 


Asp 


Met 


Ala 


Ser 


Thr 










85 










90 










95 




Tyr 


Lys 


He 


Phe 


Asp 


Gin 


Lys 


Lys 


Gly 


Leu 


Ala 


Asn 


He 


Ala 


Asn 


Lys 




100 










105 










110 


Thr 


Leu 


He 


Ser 


Gin 


Leu 


Glu 


Gin 


Lys 


Gly 


Val 


Met 


Met 


Lys 


Pro 


Xaa 






115 










120 










125 






He 


Asn 


Phe 


Gly 


Glu 


Ser 


Leu 


Lys 


Gly 


He 


Ser 


Gin 


Gly 


Cys 


Asn 


lie 




130 








135 










140 










Glu 


Ala 


Glu 


He 


Gin 


Thr 


Asp 


Lys 


Gly 


Ala 


Trp 


Thr 


Phe 


Asn 


Phe 


Asp 


145 








150 










155 










160 


Lys 

































(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 ... 91 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 



Val 


Cys 


Leu 


Gly 


Leu 


Ala 


Asp 


Val 


Met 


Val 


Val 


Leu 


Ser 


Leu 


His 


Leu 


1 






5 










10 










15 




Asn 


Leu 


Asn 


Pro 


Thr 


Asn 


Pro 


Lys 


Trp 


Leu 


Asn 


Arg 


Asp 


Arg 


Leu 


Val 








20 










25 










30 






Phe 


Ser 


Gly 


Gly 


His 


Ala 


Ser 


Ala 


Leu 


Val 


Tyr 


Ser 


Leu 


Leu 


His 


Leu 






35 










40 










45 








Trp 


Glv 


Phe 


Asp 


Leu 


Ser 


Leu 


Asp 


Asp 


Leu 


Lys 


Arg 


Phe 


Arg 


Gin 


Leu 




50 










55 










60 










His 


Ser 


Lys 


Thr 


Pro 


Gly 


His 


Pro 


Glu 


Leu 


His 


His 


Thr 


Glu 


Gly 


He 


65 








70 










75 










80 


Glu 


He 


Thr 


Thr 


Xaa 


Phe 


Arg 


Ala 


Arg 


Phe 


Cys 













85 90 



(2) INFORMATION FOR SEQ ID NO: 12 5: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 ... 187 



(xr) SEQUENCE DESCRIPTION: SEQ ID NO: 12 5 



Met 


Thr 


Thr 


Pro 


Met 


He 


He 


He 


Ser 


Leu 


Glu 


Met 


Gly 


Leu 


Ser 


Leu 


1 








5 










10 










15 




Val 


Pro 


Met 


Arg 


Gin 


Cys 


Leu 


Val 


Cys 


Gin 


Ala 


Leu 


Ala 


Arg 


Ser 


He 








20 










25 










30 






Ser 


Trp 


Asn 


Gly 


Leu 


Gly 


Gly 


Asn 


Val 


Arg 


Asn 


Thr 


Lys 


Val 


Tyr 


Gly 






35 










40 










45 








Lys 


Phe 


Ala 


Ala 


Tyr 


His 


His 


Leu 


Gin 


Lys 


Tyr 


Leu 


Leu 


He 


Asp 


Leu 




50 










55 










60 










He 


Ala 


Arg 


Phe 


Lys 


Thr 


Gin 


Gly 


Gly 


Tyr 


He 


Phe 


Arg 


Tyr 


Asn 


Thr 


65 










70 










75 










80 


Asp 


Asp 


Tyr 


Leu 


Pro 


Leu 


Asn 


Ser 


Thr 


Phe 


Tyr 


Met 


Gly 


Gly 


Val 


Thr 










85 










90 










95 




Thr 


Val 


Arg 


Gly 


Phe 


Arg 


Asn 


Gly 


Ser 


He 


Thr 


Pro 


Lys 


Asp 


Glu 


Phe 








100 










105 










110 






Gly 


Leu 


Trp 


Leu 


Gly 


Gly 


Asp 


Gly 


He 


Phe 


Thr 


Xaa 


Ser 


Thr 


Glu 


Leu 






115 










120 










125 








Ser 


Tyr 


Gly 


Val 


Leu 


Lys 


Ala 


Ala 


Lys 


Met 


Arg 


Leu 


Ala 


Trp 


Phe 


Phe 




130 










135 










140 










Asp 


Phe 


Gly 


Phe 


Leu 


Thr 


Phe 


Xaa 


Thr 


Pro 


Thr 


Arg 


Gly 


Ser 


Phe 


Phe 


145 








150 










155 










160 


Tyr 


Asn 


Ala 


Xaa 


Thr 


Thr 


Thr 


Ala 


Asn 


Phe 


Lys 


Asp 


Tyr 


Xaa 


Val 


Val 








165 










170 










175 




Gly 


Xaa 


Xaa 


Phe 


Glu 


Xaa 


Ala 


Thr 


Trp 


Arg 


Ala 


















180 










185 
















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 126 : 

















( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCT/US96/18542 



137 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(IX) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 104 





(XI) 


SEQUENC 


"E DE 


:SCRIPTION: S 


!EQ ' 


ID NO 


»: 126 










Met 


Gin 


Ala 


Leu 


Lys 


Ser 


Leu 


Leu 


Glu 


Val 


lie 


Thr 


Lys 


Leu 


Gin 


Asn 


1 








5 










10 










15 




Leu 


Gly 


Gly 


Tyr 


Leu 


Met 


His 


He 


Ala 


He 


Phe 


He 


He 


Phe 


He 


Trp 




20 










25 










30 






He 


Gly 


Gly 


Leu 


Lys 


Phe 


Val 


Pro 


Tyr 


Glu 


Ala 


Glu 


Gly 


He 


Ala 


Pro 




2 5 








a r\ 










4 5 








Phe 


Val 
50 


Xaa 


Asn 


Ser 


Pro 


Phe 
55 


Phe 


Ser 


Phe 


Met 


Tyr 
60 


Lys 


Phe 


Glu 


Lys 


Pro 


Ala 


Tyr 


Lys 


Gin 


His 


Lys 


Met 


Ser 


Glu 


Ser 


Gin 


Ser 


Met 


Gin 


Glu 


65 






70 










75 










80 


Glu 


Met 


Gin 


Asp 


Asn 


Pro 


Lys 


He 


Val 


Glu 


Asn 


Lys 


Xaa 


Trp 


His 


Lys 








35 










90 










95 




Glu 


Asn 


Arg 


Thr 
100 


Ser 


Phe 


Ser 


Gly 


















<2> 


INFORMATION 


FOR 


SEQ 


ID : 


NO: 127 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 
{ B ) TYPE: amino acid 
(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc.f eature 

(B) LOCATION 1 . . . 182 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127 



Met 


Gin 


Phe 


Glu 


Glu 


Met 


Lys 


Glu 


Leu 


Ala 


His 


Gin 


lie 


Gly 


Val 


Phe 


1 








5 










10 










15 




Tyr 


His 


Val 


Gly 


Val 


Asp 


Gly 


He 


Ala 


Leu 


Phe 


Leu 


Leu 


Leu 


Leu 


Asn 






20 










25 










30 






Ala 


He 


Val 
35 


Val 


Leu 


Leu 


Ser 


Val 
40 


Val 


Tyr 


Val 


Lys 


Glu 
45 


Arg 


Arg 


Lys 


Asp 


Phe 


Val 


He 


Cys 


Leu 


Leu 


Leu 


Leu 


Xaa 


Gly 


He 


Leu 


Met 


Gly 


Val 


50 








55 










60 










Phe 


Ser 


Ser 


Leu 


Asn 


Val 


He 


Phe 


Phe 


Tyr 


Ala 


Phe 


Trp 


Glu 


He 


Ser 


65 










70 










75 










80 


Leu 


Leu 


Pro 


Val 


Leu 


Tyr 


Leu 


He 


Gly 


Arg 


Phe 


Gly 


Arg 


Asn 


Asn 


Lys 










85 








90 










95 




He 


Tyr 


Ser 


Glv 


Met 


Lys 


Phe 


Phe 


Leu 


Tyr 


Thr 


Phe 


Leu 


Ala 


Ser 


Leu 






100 










105 










110 






Cys 


Met 


Leu 


Leu 


Gly 


He 


Leu 


Tyr 


He 


Gly 


Tyr 


Asp 


Tvr 


Ala 


Asn 


Asn 




115 










120 










125 








Tyr 


Gly 


Met 


Met 


Ser 


Phe 


Asp 


He 


Leu 


Asp 


Trp 


Tyr 


Gin 


Leu 


Asn 


Phe 


130 










135 










140 










Ser 


Ser 


Gly 


He 


Lys 


Thr 


Trp 


Leu 


Phe 


Val 


Ala 


Phe 


Leu 


He 


Gly 


He 


145 






150 










155 










160 


Ala 


Val 


Lys 


He 


Pro 


Leu 


Phe 


Pro 


Phe 


Thr 


His 


Gly 


Cys 


Leu 


Met 


Arg 








165 










170 










175 





SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9719098A1_I_> 



WO 97/19098 



PCT/US96/18542 



138 

lie Leu Thr Pro Pro Leu 
180 

(2) INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

' ix* FEATURE : 

(A) NAME/ KEY : misc_f eature 
<B) LOCATION 1 . . . 116 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 12 6 



Val 


Lys 


Lys 


Tyr 


Ala 


Glu 


Asp 


Phe 


He 


Thr 


Lys 


Asp 


Glu 


Val 


Lys 


Ser 


i 

j- 








5 










10 










15 




Leu 


Leu 


Glu 


Arg 


Leu 


Ala 


Lys 


Asp 


Tyr 


Pro 


Thr 


He 


Val 


Glu 


Glu 


Ser 








20 










25 










30 






Lys 


Lys 


He 


Pro 


Thr 


Gly 


Ala 


He 


Arg 


Ser 


Val 


Leu 


Gin 


Ala 


Leu 


Leu 


35 










40 










45 








His 


Glu 


Lys 


He 


Pro 


He 


Lys 


Asp 


Met 


Leu 


Thr 


He 


Leu 


Glu 


Thr 


He 




50 








55 










60 










Thr 


Asp 


He 


Ala 


Pro 


Leu 


Val 


Gin 


Asn 


Asp 


Val 


Asn 


He 


Leu 


Thr 


Glu 


65 








70 










75 










80 


Gin 


Val 


Arg 


Ala 


Arg 


Leu 


Ser 


Arg 


Val 


He 


Thr 


Asn 


Ala 


Phe 


Lys 


Ser 










85 










90 










95 




Glu 


Asp 


Gly 


Arg 


Leu 


Lys 


Phe 


Leu 


Thr 


Phe 


Ser 


Thr 


Asp 


Xaa 


Glu 


Gin 






100 










105 










110 







Phe Xaa Ala Gin 
115 



(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) H Y POTHET I CAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 240 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129 



Met 


Met 


Lys 


Asn 


Lys 


Arg 


Ser 


Gin 


Asn 


Ser 


Pro 


Tyr 


Val 


Thr 


Pro 


Asp 


1 






5 










10 










15 




Asn 


Pro 


Tyr 


Leu 


Thr 


Leu 


Glu 


Lys 


Ala 


Leu 


Gly 


Tyr 


Ser 


Phe 


Lys 


Asp 






20 










25 










30 






Lys 


Arg 


Leu 


Leu 


Glu 


Gin 


Ala 


Leu 


Thr 


His 


Lys 


Ser 


Cys 


Lys 


Leu 


Ala 




35 










40 










45 








Leu 


Asn 


Asn 


Glu 


Arc 


Leu 


Glu 


Phe 


Leu 


Gly 


Asp 


Ala 


Val 


Leu 


Gly 


Leu 




50 










55 










60 










Val 


He 


Gly 


Glu 


Leu 


Leu 


Tvr 


His 


Lys 


Phe 


Xaa 


Xaa 


Xaa 


Asp 


Gly 


Gly 
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65 70 75 BO 



Lys 


Leu 




Lys 


Leu 


Arg 


nld 




He 


Va 1 


Ser 


Ala 


His 


Gly 


Phe 


Thr 








ft ^ 










9 0 










95 




Lys 


Leu 


Mia 


Lys 


nld 


x x e 


A I 3 

Ala 


- 

Leu 


Ulil 


Asp 


TVr 
i y x 


Leu 


Ar g 


Val 


Ser 


Ser 






inn 

X u u 










105 










110 






ber 


r 1 1 1 

UlU 


uiU 


Tie 
lie 




Xiy s 


("I \r 

uly 


Ar 3 




uy 


Pro 


Ser 


He 


Leu 


Ser 


Ser 






1 1 P 










12 0 










12 5 








Ala 


rne 


bill 


H.1 a 


Leu 


lie 


ax a 




vox 


i yx 


T 


Glu 


Ala 


Glv 


Leu 


Ala 




130 










IOC 

13 d 










i yi n 

1 4i U 










Lys 




Arg 


Lys 


Tip 
X X tr 


X X fct 




Asn 




Leu 


Asn 


Ar g 


Ala 


Tvr 


Lys 


Ar Q" 


14b 










X P u 










155 










16 0 


Leu 


Asp 


Lieu 


\p X VJ 


ni j 


Toil 




Met 


Asp 


Tvr 

j y x 


Lys 


Thr 


Ala 


Leu 


Gin 


Glu 








165 










170 










175 




Leu 


Thr 


Gin 


xaa 


Gin 


Phe 


Cys 


Val 


Tie 


Pro 


Thr 


Tyr 


Gin 


Leu 


Leu 


Gin 








180 










185 










190 






Glu 


Lys 


Gly 


Pro 


Asp 


His 


His 


Lys 


Glu 


Phe 


Glu 


Met: 


Ala 


Leu 


Tyr 


He 






195 










200 










205 








Gin 


As* -4 


Lys 


Me t 


- j — 


Al a 




Al ?. 


Lys 


Gly 


T.wo 


?<? r " 


T.y = 


T.ye; 


GIm 






210 










215 










220 










Glu 


Gin 


Gin 


Cys 


Ala 


Tyr 


Gin 


Ala 


Leu 


Gin 


Asn 


Leu 


Arg 


Lys 


Pro 


Asn 


225 








230 










235 










240 



(2) INFORMATION FOR SEQ ID NO: 13 0: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 8 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

fiii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM : Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . . 228 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130 



Leu 


Leu 


Val 


Leu 


Leu 


Asn 


Leu 


Lys 


Xaa 


Thr 


Pro 


Asn 


Leu 


Met 


Trp 


Pro 


i 








5 










10 










15~ 




Leu 


Asp 


He 


He 


Val 


Val 


Val 


Ala 


Trp 


Val 


Leu 


Trp 


Gly 


Val 


Asn 


Met 








20 










25 










30 






Phe 


Gly 


Ser 


Met 


Ser 


Val 


Arg 


Arg 


Glu 


Asn 


Thr 


He 


Tyr 


Val 


Ser 


Leu 






35 










40 










45 








Trp 


Tyr 


Tyr 


He 


Ala 


Thr 


Tyr 


Val 


Gly 


He 


Ala 


Val 


Met 


Tyr 


He 


Phe 




50 










55 










60 










Asn 


Asn 


Leu 


Ser 


He 


Pro 


Thr 


Tyr 


Phe 


Val 


Ala 


Asp 


Met 


Gly 


Ser 


Val 


65 










70 










75 










80 


Trp 


His 


Xaa 


He 


Ser 


Met 


Tyr 


Ser 


Gly 


Ser 


Asn 


Asp 


Ala 


Leu 


He 


Gin 










85 










90 










95 




Trp 


Trp 


Trp 


Gly 


His 


Asn 


Ala 


Val 


Ala 


Phe 


Val 


Phe 


Thr 


Ser 


Gly 


Val 








100 










105 










110 






He 


Gly 


Thr 


He 


Tyr 


Tyr 


Phe 


Leu 


Pro 


Lys 


Glu 


Ser 


Gly 


Gin 


Pro 


He 






115 










120 










125 








Phe 


Ser 


Tyr 


Lys 


Leu 


Thr 


Leu 


Phe 


Ser 


Phe 


Trp 


Ser 


Leu 


Met 


Phe 


Val 




130 










135 










140 










Tyr 


lie 


Trp 


Ala 


Gly 


Gly 


His 


His 


Leu 


He 


Tyr 


Ser 


Thr 


Val 


Xaa 


Asp 


145 










150 










155 










160 


Xaa 


Val 


Gin 


Thr 


Leu 


Ser 


Ser 


Xaa 


Phe 


Ser 


Val 


Val 


Leu 


He 


Leu 


Pro 










165 










170 










175 




Ser 


Xaa 


Gly 


Thr 


Ala 


He 


Asn 


Met 


Leu 


Leu 


Xaa 


Met 


Arg 


Gly 


Gin 


Trp 








180 










185 










190 






His 


Gin 


Xaa 


Lys 


Glu 


Ser 


Pro 


Leu 


He 


Lys 


Phe 


Leu 


Val 


Leu 


Ala 


Ser 






195 










200 










205 








Thr 


Phe 


Tyr 


Met 


Leu 


Ser 


Thr 


Leu 


Glu 


Gly 


Ser 


He 


Gin 


Ala 


He 


Lys 
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210 215 220 

Ser Val Asn Ala 
225 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix) FEATURE: 

(A) NAME / KEY : mi sc_ feature 

(B) LOCATION 1 . - - 162 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 



Met 


Lys 


Ala 


Pro 


Ser 


Gin 


Xaa 


Asp 


Leu 


Lys 


Lys 


He 


Leu 


Gly 


He 


Glu 


1 






5 










10 










15 




GIu 


Val 


Tie 


Xaa 


Xaa 


Ser 


Thr 


Ser 


Pro 


Met 


Glu 


Leu 


Arg 


Leu 


Ala 


Asn 








20 










25 










30 






Gin 


Lys 


Leu 


Gly 


Asn 


Arg 


Phe 


He 


Lys 


Thr 


Leu 


Gin 


Ala 


Met 


Asn 


Glu 




35 








40 










45 








Leu 


Asp 


Met 


Gly 


Ala 


Phe 


Phe 


Asn 


Ala 


Tyr 


Ala 


Gin 


Thr 


Thr 


Lys 


Asp 




50 








55 










60 










Pro 


Thr 


His 


Ala 


Thr 


Ser 


Tyr 


Gly 


Val 


Phe 


Ala 


Ala 


Ser 


Leu 


Asn 


Met 


65 










70 








75 










80 


Glu 


Leu 


Lys 


Lys 


Ala 


Leu 


Arg 


His 


Tyr 


Leu 


Tyr 


Ala 


Gin 


Thr 


Ser 


Asn 






85 










90 










95 




Met 


Val 


He 


Asn 


Cys 


Val 


Lys 


Ser 


Val 


Pro 


Leu 


Ser 


Gin 


Asn 


Asp 


Gly 








100 








105 










110 






Gin 


Lys 


He 


Leu 


Leu 


Ser 


Leu 


Gin 


Ser 


Pro 


Phe 


Asn 


Gin 


Leu 


He 


Glu 




115 










120 










125 






Gin 


Lys 


Thr 


Leu 


Glu 


Leu 


Asp 


Glu 


Ser 


His 


Leu 


Cys 


Ala 


Ala 


Ser 


Val 


130 










135 










140 










Asn 


Asp 


He 


Lys 


Ala 


Met 


Gin 


His 


Glu 


Ser 


Leu 


Tyr 


Ser 


Arg 


Leu 


Tyr 


145 






150 










155 










160 



Met Ser 



(2) INFORMATION FOR SEQ ID NO: 13 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 ... 59 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132 

Me** Ala Phe lie Leu Thr Thr Asn Leu Phe He Lys Ser Phe Thr Asn 
15 10 15 
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Ser ii e A-g He Thr Gly Cvs He He Ser Pro Asn Val Phe Phe Ala 

20 25 30 

Ty- Glu Phe Cys Ala Leu Gly Phe Arg Lys Gly Gly Leu He Leu Asp 

35 40 
Asn Phe Ser Lys Phe Val Ser His Arg Leu Gin 
50 55 



45 



(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 8 amino acids 

(B) TYPE: ajnino acid 
(D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 248 





(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


>:133 












Val 


Arg 


Phe 


Phe 


lie 


Phe 


Leu 


He 


Leu 


lie 


Cys 


Pro 


Leu 


He 


Cys 


Pro 


1 






5 










10 










15 




Leu 


Met 


Ser 


Ala 


Asp 


Ser 


Ala 


Leu 


Pro 


Ser 


Val 


Asn 


Leu 


Ser 


Leu 


Asn 






20 








25 










30 


He 


Ala 


Ala 


Pro 


Ser 


Asp 


Pro 


Lys 


Gin 


Leu 


Val 


Thr 


Thr 


Leu 


Asn 


Val 






35 








40 










45 






Thr 


Leu 


Leu 


Thr 


Leu 


Leu 


Val 


Leu 


Ala 


Pro 


Ser 


Leu 


lie 


Leu 


Val 


Met 


50 










55 










60 






Ala 




Ser 


Phe 


Thr 


Arg 


Leu 


lie 


Val 


Val 


Phe 


Ser 


Phe 


Leu 


Arg 


Thr 


Leu 


65 








70 










75 










80 


Gly 


Thr 


Gin 


Gin 


Thr 


Pro 


Pro 


Thr 


Gin 


He 


Leu 


Val 


Ser 


Leu 


Ser 


Leu 








85 










90 










95 




lie 


Leu 


Thr 


Phe 


Phe 


He 


Met 


Glu 


Pro 


Ser 


Leu 


Lys 


Lys 


Ala 


Tyr 


Asp 








100 










105 










110 


Glu 


Ala 


Thr 


Gly 


lie 


Lys 


Pro 


Tyr 


Met 


Asp 


Lys 


Lys 


He 


Ser 


Tyr 


Thr 




115 








120 










125 






Thr 


Phe 


Glu 


Lys 


Ser 


Thr 


Leu 


Pro 


Phe 


Lys 


Glu 


Phe 


Met 


Leu 


Lys 


Asn 




130 








135 










140 








Asn 


Arg 


Glu 


Lys 


Asp 


Leu 


Ala 


Leu 


Phe 


Phe 


Arg 


lie 


Arg 


Asn 


Leu 


Pro 


145 






150 










155 








Ala 


160 


Pro 


Lys 


Thr 


Pro 


Asp 


Asp 


Val 


Ser 


Leu 


Ser 


Val 


Leu 


lie 


Pro 


Phe 








165 










170 










175 




Met 


He 


Ser 


Glu 


Leu 


Lys 


Thr 


Ala 


Phe 


Gin 


lie 


Gly 


Phe 


Leu 


Leu 


Tyr 








180 








185 










190 




Ala 


Leu 


Pro 


Phe 


Leu 


Val 


lie 


Asp 


Met 


Val 


lie 


Ser 


Ser 


lie 


Leu 


Met 






195 










200 










205 








Met 


Gly 


Met 


Met 


Met 


Leu 


Pro 


Pro 


Val 


Met 


He 


Ser 


Leu 


Pro 


Phe 


Lys 




210 










215 










220 






Glu 




He 


Leu 


Val 


Phe 


He 


Leu 


Val 


Asp 


Gly 


Phe 


Asn 


Leu 


Leu 


Thr 


Asn 


225 










230 










235 










240 


Leu 


Val 


Ala 


Ser 


Phe 


Lys 


Met 


Val 


























245 

























(2) INFORMATION FOR SEQ ID NO: 13 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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<iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc^f eature 

(B) LOCATION 1 . . . 166 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 4 



Leu 


Leu 


Val 


Thr 


Phe 


Leu 


Asn 


Gly 


Phe 


Asp 


Pro 


Lys 


He 


Ala 


Asn 


Leu 


1 








5 










10 










15 




Arg 


Lys 


Ala 


Cys 


Asn 


Val 


Tyr 


Ser 


Val 


Gly 


Val 


He 


Tyr 


He 


Val 


Thr 








20 










25 










30 






Thr 


Asn 


Thr 


Leu 


Asn 


He 


Leu 


Ser 


Cys 


Glu 


Ser 


Phe 


Glu 


He 


Leu 


Glu 






35 










40 










45 








T.y* 


Arn 


m u 


Leu 


A?=;p 


Thr 


Ser 


Gly 


Val 


Thr 


Lys 


Thr 


Ser 


Thr 


Pro 


Phe 




50~ 










55 










60 










Phe 


Ser 


Arg 


Val 


Glu 


Gly 


He 


Asp 


Ala 


Gly 


Thr 


Leu 


Gly 


Lys 


Leu 


Phe 


65 










70 










75 










80 


Ser 


Gly 


Ser 


Gin 


Ser 


Lys 


Asn 


Tyr 


Phe 


Ala 


Tyr 


Tyr 


Asp 


Ala 


Leu 


Val 










85 










90 










95 




Lys 


Lys 


Glu 


Lys 


Arg 


Lys 


Glu 


Val 


Arg 


He 


Glu 


Lys 


Lys 


Glu 


Glu 


Arg 








100 










105 










110 






He 


Asp 


Ala 


Arg 


Glu 


Asn 


Lys 


Arg 


Glu 


He 


Lys 


Gin 


Glu 


Ala 


He 


Lys 






115 










120 










125 








Glu 


Pro 


Lys 


Lys 


Ala 


Asn 


Gin 


Gly 


Thr 


Glu 


Asn 


Ala 


Pro 


Thr 


Leu 


Glu 




130 










135 










140 










Glu 


Lys 


Xaa 


Tyr 


Gin 


Lys 


Ala 


Glu 


Arg 


Lys 


Phe 


Asp 


Ala 


Lys 


Xaa 


Xaa 


145 










150 










155 










160 


Arg 


Arg 


Ser 


Phe 


Lys 


Xaa 






























165 
























(2) 


INFORMATION 


FOR 


SEQ 


ID : 


NO: 135 : 

















<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix} FEATURE: 

(A) NAME/KEY: misc„f eature 

(B) LOCATION 1 . . . 127 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 13 5 



Met 


Glu 


Asn 


Ser 


Thr 


Leu 


Tyr 


He 


Val 


He 


Ala 


Gly 


Leu 


Trp 


Leu 


Ala 


1 








5 










10 










15 




Val 


Gly 


Phe 


Gly 


He 


Phe 


Leu 


Lys 


Lys 


Leu 


Asp 


Met 


Pro 


Val 


He 


He 








20 










25 










30 






Gly 


Tyr 


He 


Cys 


Thr 


Gly 


Thr 


Val 


Leu 


Ala 


Ala 


Phe 


Phe 


Lys 


He 


Asn 






35 










40 










45 








Asp 


Phe 


Asn 


Leu 


Leu 


Ser 


Asp 


He 


Gly 


Glu 


Phe 


Gly 


He 


Val 


Phe 


Leu 




50 










55 










60 










Met 


Phe 


Met 


He 


Gly 


He 


Glu 


Phe 


Asn 


Phe 


Asp 


Lys 


Leu 


Lys 


Ser 


He 


65 










70 










75 










80 


Lys 


Gin 


Glu 


Val 


Leu 


Val 


Phe 


Gly 


Leu 


Leu 


Gin 


Val 


Val 


Leu 


Cys 


Ala 










85 










90 










95 




Leu 


He 


Ala 


Phe 


Leu 


Leu 


Gly 


Tyr 


Phe 


Val 


Leu 


Gly 


Leu 


Ser 


Pro 


He 








100 










105 










110 






Phe 


Ser 


Leu 


Val 


Leu 


Gly 


Met 


Gly 


Leu 


Ser 


Leu 


Ser 


Ser 


Thr 


Ala 
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115 120 125 

(2) INFORMATION FOR SEQ ID NO: 13 6: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(B) LOCATION 1 ... 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 6 

Leu Leu Leu Met Leu Asn Lys Pro Lys Pro Leu Phe Leu Xaa Leu Gly 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix) FEATURE: 

(A) NAME /KEY : misc_f ea tur e 

(B) LOCATION 1 ... 350 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137 

Leu Phe Leu Gin Ala 
15 

Lys Gin Met Ala Leu 
30 

Tro Leu Lys Lys Phe 
45 

Glu Lys Glu Ser Val 
60 

Ser Lys Asp Leu Leu 
80 

Leu Glu Ser Gin Gin 
95 

Lys Asp Leu lie Glu 
110 

Asn Pro lie Ala lie 
125 

Leu Lys His Glu As n 
140 

Leu Lys Leu Leu Asp 
160 

Leu Asp Lys Ser Ala 
175 

Lys Arg Leu Glu Leu 
190 



Met 


Ala 


Leu 


Arg 


Val 


Leu 


Leu 


Phe 


Phe 


Cys 


Phe 










5 










10 




Glu 


Asp 


Lys 


Ser 


Gin 


Glu 


Leu 


Ser 


Ser 


He 


Gin 








20 










25 






Val 


Asp 


Lys 


Lys 


Leu 


Ala 


Lys 


Asp 


Asp 


Asn 


Val 






35 










40 








Glu 


Asn 


Tyr 


Lys 


He 


Tyr 


Asn 


Gin 


He 


Tyr 


Thr 




50 










55 










Arg 


Gin 


Glu 


Leu 


Arg 


Arg 


Leu 


Lys 


Asn 


Lys 


Lys 


65 










70 










75 


Lys 


He 


Ser 


Thr 


Leu 


Glu 


His 


Thr 


Leu 


Lys 


Ala 










85 










90 




Lys 


Met 


Phe 


Glu 


Ser 


Tyr 


Gly 


Val 


Asn 


Pro 


Phe 








100 










105 






Arg 


Pro 


Asn 


He 


Pro 


Asn 


He 


Pro 


Asn 


lie 


Ala 






115 










120 








He 


Asp 


Gly 


He 


Ser 


Phe 


He 


Lys 


Ser 


Met 


Arg 




130 










135 










Leu 


Lys 


Asn 


Asn 


Gin 


Thr 


Ser 


Leu 


Gly 


Glu 


Val 


145 










150 










155 


Gin 


Lys 


His 


Gin 


Leu 


Leu 


Asn 


Gin 


Trp 


His 


Ala 










165 










170 




Lys 


Leu 


Ser 


Asp 


Glu 


He 


Tyr 


Gin 


Thr 


Gin 


Ala 








180 










185 
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Gin 


Gly 


Ala 


Gin 


Asn 


He 


Leu 


Lys 


Thr 


Asp 




195 










200 




Ser 


Asp 


Glu 


Ala 


He 


Ser 


lie 


Val 




210 










215 


Leu 


Phe 


Lys 


Leu 


Val 


Tyr 


Val 


Phe 


Leu 


22 5 










230 








Pne 


Ala 


Trp 


lie 


Leu 


Lys 


He 


He 


Ser 


GlU 








245 










Arg 


Val 


Tyr 


Thr 


Val 


Asn 


Lys 


Ala 


Val 






260 








265 


He 


Xaa 


Xaa 


He 


Xaa 


Leu 


Phe 


Ser 


Leu 




27 5 










280 


Val 


Thr 


Val 


Leu 


Gly 


Phe 


Ala 


Ser 




290 










295 




Lys 


Asp 


Leu 


Phe 


Met 


Ser 


Leu 


Leu 


Gly 


305 










310 






Gly 


Ser 


Val 


His 


Val 


Gly 


Asp 


Arg 


Val 


















He 


Phe 


lie 


Gly 


Asp 


Val 


Leu 


Asp 


Thr 








340 








345 
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Thr He Gly He Phe Gin Lys 
205 

Lys Ser Gin Val Lys Asn Gin 
220 

Ala Ala Leu Leu Ser Val Val 
235 240 
Ser Lys Tyr He Glu Asn Asn 
250 255 
lie Asn Phe Val Asn Val Ser 
270 

Tyr Leu Glu Asn Val Thr Tyr 
285 

Ala Gly Leu Ala He Xaa Met 
300 

Trp Phe He He Leu He Gly 
315 320 
Arg lie Ala Lys Gly Thr Asp 

"> -> o - - 

Ser Asn Val Val His 
350 

(2) INFORMATION FOR SEQ ID NO: 13 8: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 ammo acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

( B ) LOCATION 1 ... 99 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8 



30 

Asn Leu Asp Ala Ala Lys Phi 
45 

Leu Ser Glu Lys lie Ser Arc 
60 

Pro Arg Thr Leu Lys Lvs Ar< 
75 " 80" 

Cys Lys Tyr Pro Thr Thr Pr< 
90 95 



Met 
1 


Lys 


Glu 


Gin 


Glu 
5 


Trp 


Asp 


Leu 


Ser 


Ser 


Ala 


Glu 


Glu 


Phe 


Leu 


Lys 


Thr 


Leu 


Glu 






20 








25 


Asn 


Ala 


Tyr 


Gin 


Asn 


Asn 


Leu 


Lys 


Ala 




35 










40 


Asn 


Thr 


Leu 


Lys 


His 


Tvr 


Glu 


Asn 




50 










55 




Ala 


Met 


Ala 


Tyr 


Ala 


Asn 


Tyr 


Phe 


Leu 


65 










70 






Ser 


Phe 


He 


Arg 


Asn 


Ala 


Asn 


Gly 


Leu 










85 








Phe 


He 


Leu 














(2) 


INFORMATION 


FOR 


SEQ 


ID NO:139: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME /KEY : misc_£eature 

(B) LOCATION 1 ... 78 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139 

Leu Arg Val Gly Met Tyr Glu Val Cys Asn His Lys Asd Gly Thr Ala 

15 10 15 

Tyr His Ser Thr Arg Gly Ser Lys Val Thr Leu Ala Cys Lys Thr Gly 

20 25 30 

Thr Ala Gin Val Val Glu lie Ala Gin Asn lie Val Asn Arg Met Lvs 

35 40 45 

Glu Lys Asp Met Glu Tyr Phe His Xaa Ser His Xaa Trp lie Thr Xaa 

50 55 60 

Tyr Leu Xaa Pro Met Lys Asn Pro Asn Thr Leu Ser Leu Phe 

65 70 75 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 52 amino acids 
{ B ) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(lii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 ... 52 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 0 

Leu Gly Leu Val Xaa Gly lie Ser Leu Leu His Leu Ser Leu Glu Gin 

15 10 15 

Lys lie Ser Val Phe Leu Gly Xaa Asn Leu Met Leu Tyr Pro Val Xaa 

20 25 30 

Glu Val Leu Phe Ser lie Leu Arg Arg Lys lie Lys Arg Gin Lys Ala 

35 40 45 

Thr His Ala Gly 
50 

{2) INFORMATION FOR SEQ ID NO: 141: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 77 amino acids 
{BJ TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 ... 377 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141 

Leu Ala Gin Pro Val Gin Val Arg Thr Val Phe Met Ser Met Thr Leu 
1 5 10 15 
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Asn Ala Met Gly Gin Phe Ala Tyr Asn Phe Pro Ala Asn lie Ser Lys 

20 25 30 

Asp Lys Gin Lys Leu Thr Met Val Tyr Met Asp Lys Asp Tyr Gly Ala 

35 40 45 

Tyr Gly Asn lie Val Ala Met Gly Gly Glu Tyr Val Lys lie Glu Leu 

50 55 60 

Glv Thr Aso Thr Gly Leu Asn Pro Phe Ala Trp Ala Ala Cys Val Gin 
65^ * 70 75 80 

Lys Thr Asn Ala Thr Met Glu Gin Lys Gin Thr Ala lie Ser Val Val 

85 90 95 

Lys Glu Leu Val Lys Asn Leu Ala Thr Lys Ser Asp Glu Lys Asp Glu 

100 105 110 

Asn Gly Asn Ser lie Ser Phe Ser Leu Ala Asp Ser Asn Thr Leu Ala 

115 120 125 

Ala Ala Val Thr Asn Leu lie Thr Gly Asp Met Asn Leu Asp Tyr Pro 

130 135 140 

lie Thr Gin Leu lie Asn Ala Phe Gly Lys Asp His Asn Asp Pro Asn 
145 150 155 160 

Gly Leu Val Ala Arg Leu Ala Pro Phe Cys Lys Ser Thr Asn Gly Glu 

165 170 175 

Phe Gin Trp Leu Phe Asp Asn Lvs Ala Thr Asp Arg Leu Asp Phe Ser 

180 185 190 

Lys Thr He He Gly Val Asp Gly Ser Ser Phe Leu Asp Asn Asn Asp 

195 200 205 

Val Ser Pro Phe lie Cys Phe Tyr Leu Phe Ala Arc He Gin Glu Ala 

210 215 220 

Met Asp Gly Arg Arg Phe Val Leu Asp He Asp Glu Ala Trp Lys Tyr 
225 230 235 240 

Leu Gly Asp Pro Lys Val Ala Tyr Phe Val Arg Asp Met Leu Lys Thr 

245 250 255 

Ala Arg Lys Arg Asn Ala He Val Arg Leu Ala Thr Gin Ser He Thr 

260 265 270 

Asp Leu Leu Ala Cys Pro He Ala Asp Thr He Arg Glu Gin Cys Pro 

275 280 285 

Thr Lys He Phe Leu Arg Asn Asp G3y Gly Asn Leu Ser Asp Tyr Gin 

290 295 300 

Arg Leu Ala Asn Val Thr Glu Lys Glu Phe Glu He He Thr Lys Gly 
305 310 315 320 

Leu Asp Arg Lys He Leu Tyr Lys Gin Asp Gly Ser Pro Ser Val He 

325 330 335 

Ala Ser Phe Asn Leu Arg Gly He Pro Lys Glu Tyr Leu Lys He Leu 

340 345 350 

Ser Thr Asp Thr Val Phe Val Lys Glu He Asp Lys lie He Gin Asn 

355 360 365 

His Ser He lie Asd Lys Tyr Gin Pro 
370 375 

(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 ... 154 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142 

Met Lys Leu Val Ser Leu He Val Ala Leu Val Phe Cys Cys Phe Leu 
IS 10 15 
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Ala 


Val 


Glu 


Leu 


Pro 


Gly 


Val 


Tvr 


Gin 


Thr 


Gin 


Glu 


Phe 


Leu 


Tyr 






20 










25 










30 






Met 


Lys 


Ser 


Ser 


Phe 


Val 


Glu 


Phe 


Phe 


Glu 


Kis 


Asn 


Gly 


Lys 


Phe 


Tyr 




35 










40 










45 








Ala 


Tvr 


Gly 


He 


Ser 


Asp 


Val 


Xaa 


Xaa 


Ser 




Ala 


Lys 


Lys 


Asp 


Lys 




50 










55 










60 






Val 




Leu 


Asn 


Pro 


Asn 


Pro 


Lys 


Leu 


Arg 


Asn 


Arg 


Ser 


Asp 


Lys 


Gly 


Val 


65 










70 










75 








CjI V 


80 


Phe 


Leu 


Ser 


Asp 


Leu 


He 


Lys 


Val 


Gly 


Glu 


Gin 


Ser 


Tyr 


Lys 


biy 










8 5 










90 










95" 




Lys 


Ala 


Xaa 


Asn 


Phe 


Xaa 


Asp 


Gly 


Lys 


Thr 


Xaa 


His 


Val 


Arg 


Val 


Thr 






100 










105 










11 U 






Gin 


Xaa 


Ser 


Asn 


Gly 


Asp 


Leu 


Xaa 


Phe 


Thr 


Ser 


Ser 


Tyr 


Xaa 


Lys 


Trp 






115 








120 










125 




Glu 


Glu 


Gly 


Tyr 


Val 


Gly 


Lys 


Thr 


Phe 


Thr 


Trp 


Lys 


Arg 


Leu 


Ser 


Asp 


130 










135 










140 










He 


Lys 


Asn 


Leu 


Lys 


Leu 


Lys 


Arg 


Phe 


Asn 














14 5 










15 0 






















(2) 


INFORMATION 


FOR 


SEQ 


ID ; 


NO : 1 4 3 : 

















<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 ... 7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143 

Leu Glu Thr Leu Phe Leu Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 144: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 
{ D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 . . . 114 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144 

Met Xaa Thr His Asp Arg Ara Lys Leu Arg lie Xaa Leu Thr Gin Thr 

15 10 15 

Thr Thr Leu Val Ala Thr He Gly Ser Asn Ala Pro Tyr He Gly Leu 

20 25 30 

Leu Glv Thr Val Met Gly He Met Leu Thr Phe Met Asp Leu Gly Ser 

35 40 45 

Ala Ser Gly lie Asp Thr Lys Ala He Met Thr Asn Leu Ala Leu Ala 
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50 55 60 

Leu Lys Ala Thr Gly Met Giy Leu Leu Val Ala lie Pro Ala lie Val 
6 ? 70 "75 so 

lie Tyr Asn Leu Leu Val Arg Lys Ser Glu He Leu Val Thr Lvs Tro 
85 90 95 P 

Asp He Phe His His Pro Val Asp Thr Gin Ser Hrs Glu Val Tv- Ser 
100 105 no 

Lys Ala 

(2) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{ i i. ) MOLECULE TYPE- prcte-in 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

{ix} FEATURE: 

(A) NAME/ KEY ; misc_f eature 

(B) LOCATION 1 ... 67 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145 



15 

Val Ly< 
30 



Met 


Gin Asp Leu 


Asp 


Asn 


Asn 


Met Ser 


Leu 


Asp 


Thr 


Ala 


1 


Ser Ser Asn 


5 








10 






Leu 


Gly 


Lys 


Asn 


He Thr 


He 


Ala 


Gly 


Val 




20 








25 








Leu 


Gin Lys He 


Gly 


Val 


Ser 


Ala Lys 


Gly 


.Met 


Val 


Ser 


Ala 


35 








40 






45 


Leu Lys Lys 


Ser 


Gly 


Ala 


He Ser 


Ala 


Lys 


Trp 


Arg 




50 






55 






60 




Lys 


Gin Gin 
















65 




















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 146 : 











U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 8 amino acids 
{ B ) TYPE: amino acid 
( D) TOPOLOGY : linear 

(iij MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc__f eature 

(B) LOCATION 1 ... 38 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146 

Leu His Pro Leu Ala Asp Val Phe Val Val Asn Asp Lys Arg Xaa Val 

1 5 io is 

Leu Ala Met Val Xaa Met Leu He Xaa Ser Leu Ala Asn He Phe Phe 

20 25 30 

Asn Tyr Leu Phe He Phe Xaa Leu Glu Val Gly Val Gin Giy Xaa Ala 

35 40 45 

He Val Thr Val He Gly His Ala He Gly Giy Leu Val Leu Met Gin 
50 55 60 
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His Phe Trp Arg Lys Lys Gly Glu Leu Tvr Phe He Lys Leu lie Phe 
65 70 75 80 

Phe lie Phe Ser His Phe Phe Ser 
35 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

/ j\ \ ryrTtr***v~rs*y\r. * ♦ — l - ^ v — — . — - - 1 

(ix) FEATURE: 

(A) NAME/ KEY : misc_£eature 

(B) LOCATION 1 ... 276 

fxi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 4 7 



Met 


Leu 


Arg 


Lys 


Asn 


He 


Leu 


Ala 


Tyr 


Tyr 


Gly 


Ala 


Asn 


Phe 


Leu 


Leu 


1 








5 










10 










15 




lie 


He 


Ala 


Gin 


Ser 


Leu 


Pro 


His 


Ala 


He 


Leu 


Thr 


Pro 


Leu 


Leu 


Leu 








20 










25 










30 






Ser 


Lys 


Glv 


Leu 


Ser 


Leu 


Ser 


Glu 


He 


Leu 


Leu 


Val 


Gin 


Thr 


Phe 


Phe 






35 










40 










45 








Ser 


Phe 


Cys 


Val 


Leu 


Val 


Ala 


Glu 


Tyr 


Pro 


Ser 


Gly 


Val 


Leu 


Ala 


Asp 




50 










55 










60 










Leu 


Met 


Ser 


Arg 


Lys 


Asn 


Leu 


Phe 


Leu 


Val 


Ser 


Asn 


Val 


Phe 


Leu 


He 


65 










70 










75 










80 


Ala 


Ser 


Phe 


Ser 


Leu 


Val 


Leu 


Phe 


Phe 


Asp 


Ser 


Phe 


He 


Leu 


Met 


Leu 










85 










90 










95 




Leu 


Ala 


Trp 


Gly 


Leu 


Tyr 


Gly 


Leu 


Tyr 


Ser 


Ala 


Cys 


Ser 


Ser 


Gly 


Thr 








100 










105 










110 






He 


Glu 


Ala 


Ser 


Leu 


lie 


Thr 


Asp 


lie 


Lys 


Glu 


Asn 


Lys 


Lys 


Asp 


Leu 






115 










120 










125 








Ser 


Lys 


Phe 


Leu 


Ala 


Lys 


Asn 


Asn 


Gin 


He 


Thr 


Tyr 


Leu 


Gly 


Met 


He 




130 










135 










140 










He 


Gly 


Ser 


Ser 


Leu 


Gly 


Ser 


Phe 


Leu 


Tyr 


Leu 


Lys 


Val 


His 


Ala 


Met 


145 










150 










155 










160 


Leu 


Tyr 


Val 


Val 


Gly 


He 


Phe 


Leu 


He 


Met 


Leu 


Cys 


Ala 


Leu 


Thr 


He 










165 










170 










175 




He 


He 


Tyr 


Phe 


Lys 


Glu 


Lys 


Glu 


Gly 


Asp 


Phe 


Lys 


Ser 


Gin 


Lys 


Asn 








180 










185 










190 






Leu 


Lys 


Leu 


Leu 


Lys 


Glu 


Gin 


Val 


Lys 


Gly 


Ser 


Leu 


Lys 


Glu 


Leu 


Lys 






195 










200 










205 








Asp 


Asn 


Pro 


Lys 


Leu 


Lys 


He 


Leu 


Leu 


Val 


Gly 


His 


Leu 


lie 


Thr 


Pro 




210 










215 










220 










Val 


Phe 


Phe 


Met 


Ser 


His 


Phe 


Gin 


Met 


Trp 


Gin 


Ala 


Tyr 


Phe 


Leu 


Lys 


225 










230 










235 










240 


Gin 


Gly 


Val 


Lys 


Glu 


Gin 


Tyr 


Leu 


Phe 


Val 


Phe 


Tyr 


He 


Ala 


Phe 


Gin 










245 










250 










255 




Val 


He 


Ser 


lie 


Pro 


His 


Ser 


Phe 


Phe 


Lys 


Ser 


Gin 


Lys 


Leu 


Xaa 


Ala 



260 265 270 



Lys Lys Ser Pro 
275 

(2) INFORMATION FOR SEQ ID NO: 14 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: S3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9719098A1_L> 



WO 97/19098 



PCI7US96/I8542 



150 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc__f eature 

(B) LOCATION 1 ... 93 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148 



Leu 


Tyr 


Pro 


Pro 


Gly 


Ser 


Val 


Val 


Lys 


Met 


Gly 


Val 


Gly 


Leu 


Ser 


Phe 


1 






5 










10 










15 




Leu 


Glu 


Asn 


Leu 


His 


He 


Thr 


Glu 


Asn 


Thr 


Thr 


He 


Pro 


Thr 


Pro 


Pro 








20 










25 










30 






Phe 


He 


Glu 


Val 


Gly 


Lys 


Ax g 


Lys 


Phe 


Arg 


Asp 


Trp 


Lys 


Lys 


Thr 


Gly 






35 










40 










45 








His 


Gly 


Asn 


Ser 


Asn 


Leu 


Tyr 


Lys 


Ala 


He 


Arg 


Glu 


Ser 


Val 


Asp 


Val 




50 










55 










60 










Tyr 


Phe 


Tyr 


Lys 


Phe 


Gly 


Leu 


Glu 


He 


Ser 


He 


Glu 


Xaa 


Leu 


Ser 


Lys 


65 










70 










75 










80 


Xaa 


Phe 


Lys 


Xaa 


Ser 


Gly 


Leu 


Trp 


Gly 


Lys 


Asn 


Gly 


Arg 









85 90 



(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 60 amino acids 
(3) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

{ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eacure 

(B) LOCATION 1 ... 60. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149 



Met 


Ala His His 


Xaa 


Glu 


Gin 


His Gly 


Gly 


His 


His 


His 


His 


His 


His 


1 




5 








10 










15 




His 


Thr His His 


His 


His 


Tyr 


His Gly 


Gly 


Glu 


His 


His 


His 


His 


His 




20 








25 










30 






His 


Ser Ser His 


His 


Glu 


Glu 


Gly Cys 


Cys 


Ser 


Thr 


Ser 


Asp 


Ser 


His 




35 








40 








45 








His 


Gin Glu Glu 


Gly 


Cys 


Cys 


His Gly 


Xaa 


His 


Glu 












50 






55 








60 










(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO:150 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 97 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME /KEY: misc_f eatur e 

( B) LOCATION 1 . . .29 7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 

Leu Vai Lvs He Arg Leu Phe Asr> Phe Thr He Arg Leu Phe Lys Pro 

1*5 10 15 

Glu Phe His He Phe Asp Phe Leu Lys Gly He Arg Val Leu Met He 

20 25 30 

Glu Trp Met Gin Asn His Arg Lys Tyr Leu Val Val Thr He Trp He 

35 40 45 

Ser Thr lie Ala Phe He Ala Ala Gly Met He Gly Trp Gly Gin Tyr 

50 55 60 

Ser Phe Ser Leu Asp Ser Asp Ser Ala Ala Lys Val Gly Gin He Lys 
65 70 75 80 

He Ser Gin Glu Glu Leu Ala Gin Glu Tyr Arg Arg Leu Lys Asp Ala 

o ~> s ^ ■-> 

Tyr Ala Glu Ser He Pro Asp Phe Lys Glu Leu Thr Glu Asp Gin He 

100 105 110 

Lys Ala Met His Leu Glu Lys Ser Ala Leu Asp Ser Leu lie Asn Gin 

115 120 125 

Ala Leu Leu Arg Asn Phe Ala Leu Asp Leu Gly Leu Gly Ala Thr Lys 

130 135 140 

Gin Glu Val Ala Lys Glu He Arg Lys Thr Asn Val Phe Gin Lys Asp 
145 150 155 160 

Glv Val Phe Asp Glu Glu Leu Tyr Lys Asn He Leu Lys Gin Ser His 

165 170 175 

Tyr Arg Pro Lys His Phe Glu Glu Ser Val Glu Arg Leu Leu He Leu 

180 185 190 

Gin Lys He Ser Ala Leu Phe Pro Lys Thr Thr Thr Pro Leu Glu Gin 

195 200 205 

Ser Ser Leu Ser Leu Trp Ala Lys Leu Gin Asp Lys Leu Asp He Leu 

210 215 220 

He Leu Asn Pro Asn Asp Val Lys He Ser Leu Asn Glu Glu Glu Met 
225 230 235 240 

Lys Lys Tyr Tyr Glu Asn His Arg Lys Asp Phe Lys Lys Pro Thr Ser 

245 250 255 

Phe Lys Thr Arg Ser Leu Tyr Phe Asp Ala Ser Leu Glu Lys Thr Asp 

260 265 270 

Leu Lys Glu Leu Glu Glu Tyr Tyr His Lys Asn Lys Val Ser Tyr Leu 

275 280 285 

Asp Xaa Xaa Gly Glu He Thr Gly Phe 
290 295 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME /KEY : misc_f eatur e 

(B) LOCATION 1 ... 90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 

Met Val Lys His Tyr Leu Phe Met Ala Vai Ser Gin Val Phe Phe Ser 
15 10 15 

Phe Phe Leu Val Leu Phe Phe He Ser Ser He Val Leu Leu He Ser 
20 25 30 
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He 


Ala Ser Val 


Thr 


Leu 


Val 


He Lys 


Val 


Ser 


Phe 


Leu 


Asp 


Leu 


Val 




35 








40 








45 








Gin 


Leu Phe Leu 


Tyr 


Ser 


Leu 


Pro Gly 


Thr 


He 


Phe 


Phe 


lie 


Leu 


Pro 




50 






55 








60 










He 


Thr Phe Phe 


Ala 


Ala 


Xaa 


Arg Leu 


Gly 


Xaa 


Ser 


Arg 


Leu 


Ser 


Tyr 


65 






70 








75 










80 


Asp 


His Glu Leu 


Leu 


Val 


Phe 


Phe Leu 


Xaa 


















85 








90 














(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 152 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 86 





(xi) 


SEQUENCE DESCRIPTION: SEQ 


ID NO 


:152 












Met 


Ser 


Lys 


Arg 


Ala 


He 


Arg 


Phe Pro 


Asn 


Lys 


Leu 


Phe 


Ser 


Tyr 


Pro 


1 






5 








10 










15 




Lys 


Pro 


Lys 


He 


Lys 


Ala 


Thr 


Asn Thr 


Ser 


His 


Thr 


Val 


Leu 


Phe 


Ala 






20 








25 










30 






Tyr 


Pro 


Leu 


Lys 


Pro 


His 


Glu 


Met Ala' 


Leu 


Leu 


Ala 


Leu 


Ala 


Thr 


Ser 




35 










40 








45 








Leu 


Leu 


Ala 


Pro 


lie 


Phe 


Asn 


Ala He 


His 


Ser 


Thr 


Asn 


Ala 


Leu 


Asn 




50 










55 








60 










Ala 


He 


Lys 


Pro 


Asp 


Gly 


Thr 


Gly Ser 


Lys 


He 


Asn 


Pro 


He 


He 


Met 


65 










70 








75 










80 


Pro 


Met 


Lys 


He 


Gin 


Lys 





















85 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 ... 141 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 

Val Tyr Ser Arg Phe Phe Ala Asn Gin His Glu Phe Asp Phe Glu Ala 

15 10 15 

Gin Giy Ala Leu Gly Ser Asp Gin Ser Ser Leu Asn Phe Lys Ser Thr 

20 25 30 

Leu Leu Gin Asp Leu Asn Gin Ser Tyr Asn Tyr Leu Ala Tyr Ser Ala 

35 40 45 

Thr Ala Arg Ala Ser Tyr Gly Tyr Asp Phe Ala Phe Phe Arg Asn Ala 
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50 55 60 



Leu 


Val 


Leu 


Lys 


Pro 


Ser 


Val 


Gly 


Val 


Ser 


Tyr 


Asn 


His 


Leu 


Gly 


Ser 


65 










70 










75 










80 


Thr 


Asn 


Phe 


Lys 


Ser 
85 


Asn 


Ser 


Gin 


Ser 


Gin 

90 


Val 


Ala 


Leu 


Lys 


Asn 
95 


Gly 


Ala 


Ser 


Ser 


Gin 
100 


Kis 


Leu 


Phe 


Asn 


Ala 
105 


Asn 


Ala 


Thr 


Trp 


Lys 
110 


Arg 


Val 


He 


He 


Met 

115 


Gly 


Thr 


Leu 


His 


Thr 
120 


Phe 


He 


Cys 


Met 


Trp 
125 


Giu 


Phe 


Tyr 


Lys 


Ser 
130 


Ser 


Leu 


Thr 


Leu 


ASD 

135 


Arg 


Met 


Met 


Trp 


Arg 
140 


Leu 









(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : mi sc_ f ea tur e 

(B) LOCATION 1 ... 185 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 



Met 


Xaa 


Glu 


Asn 


Gly 


Arg 


Gly 


Val 


Pro 


Lys 


Asp 


Tyr 


Lys 


Lys 


Ala 


Val 


1 








5 










10 










15 




Glu 


Tyr 


Phe 


Gin 


Lys 


Ala 


Val 


Asp 


Asn 


Asp 


He 


Pro 


Arg 


Gly 


Tyr 


Asn 








20 










25 










30 






Asn 


Leu 


Gly 


Val 


Met 


Tyr 


Lys 


Glu 


Gly 


Lys 


Gly 


Val 


Pro 


Lys 


Asp 


Glu 






35 










40 










45 








Lys 


Lys 


Ala 


Val 


Glu 


Tyr 


Phe 


Arg 


He 


Ala 


Thr 


Glu 


Lys 


Gly 


Tyr 


Thr 




50 










55 










60 










Asn 


Ala 


Tyr 


He 


Asn 


Leu 


Gly 


He 


Met 


Tyr 


Met 


Glu 


Gly 


Arg 


Gly 


Val 


65 










70 










75 










80 


Pro 


Ser 


Asn 


Tyr 


Ala 


Lys 


Ala 


Thr 


Glu 


Cys 


Phe 


Arg 


Lys 


Ala 


Met 


His 










85 










90 










95 




Lys 


Gly 


Asn 


Val 


Xaa 


Ala 


Tyr 


He 


Leu 


Leu 


Gly 


Asp 


He 


Tvr 


Tyr 


Ser 








100 










105 










110 






Gly 


Met 


He 


Asn 


Trp 


Val 


Leu 


Ser 


Arg 


Thr 


Lys 


He 


Arg 


Leu 


Val 


His 






115 










120 










125 








Tyr 


Lys 


Met 


Ala 


Ala 


Asp 


Val 


Ser 


Ser 


Ser 


Arg 


Ala 


Tyr 


Xaa 


Gly 


Leu 




130 










135 










140 










Ser 


Glu 


Ser 


Tyr 


Xaa 


Tyr 


Gly 


Leu 


Gly 


Val 


Glu 


Lys 


Xaa 


Xaa 


Lys 


Lys 


145 










150 










155 










160 


Ala 


Glu 


Glu 


Tyr 


Met 


Gin 


Lys 


Ala 


Cys 


Asp 


Phe 


Asp 


He 


Asp 


Lys 


Asn 










165 










170 










175 




Cys 


Lys 


Lys 


Lys 


Asn 


Thr 


Ser 


Ser 


Arg 






















180 










185 
















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 155 : 

















U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 9 amino acids 
{ B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

fix) FEATURE: 

(A) NAME / KEY : misc_f eacure 

(B) LOCATION 1 . . . 139 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 5 5 



Leu 


Leu 


Asn 


Met 


Trp 


Asp 


Glu 


Ala 


Lys 


Lys 


Glu 


Gly 


He 




T i ^ 


Asn 


1 








5 










10 










15 




Thr 


Glu 


Lys 


Leu 


Ser 


Gin 


Glu 


Leu 


Gly 


Val 


Val 


Cys 


Val 


Pro 


Thr 


Ser 






20 










25 










30 






Ala 


Arg 


Xaa 


Lys 


Glu 


Asp 


Arg 


Leu 


Asn 


Thr 


Glu 


Leu 


Leu 


Leu 


Asp 


Glu 




35 










40 










45 








He 


Val 


Arg 


Leu 


Tyr 


Ser 


Gin 


Asn 


Thr 


Thr 


Asn 


Asn 


Glu 


Asn 


He 


Lys 




50 










55 










60 










Val 


Pro 


Ser 


Gin 


Ser 


Phe 


Lys 


Glu 


Ser 


Leu 


Lys 


Tyr 


Ser 


Gin 


Ser 


Ala 


6 5 










7 0 










7 5 










30 


Gin 


Arg 


He 


Ala 


Lys 


Ser 


Val 


He 


Ser 


Glu 


Asn 


Lys 


Gin 


Asn 


Ala 


Ser 








85 










90 










95 




Phe 


Glu 


His 


Thr 


Tyr 


Lys 


He 


Asp 


Lys 


He 


Phe 


Asn 


Ala 


Pro 


Ala 


Leu 








100 










105 










110 






Trp 


Asp 


Phe 


His 


Phe 


Phe 


Xaa 


Val 


Tyr 


Val 


Tyr 


His 


Leu 


Phe 


Phe 


Glu 




115 










120 










125 








Leu 


Phe 


Asn 


Arg 


Arg 


Gly 


Ser 


Ala 


Lys 


Ser 


Pro 













130 135 



(2) INFORMATION FOR SEQ ID NO; 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . - 193 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 



Met 


Gin 


Glu 


Ala 


Leu 


Leu 


Arg 


Phe 


Gin 


Glu 


Gly 


Phe 


Lys 


Glu 


Trp 


Gly 


1 








5 










10 










15 




Tyr 


Leu 


He 


Leu 


Phe 


Leu 


Tyr 


Ser 


Leu 


Gly 


Gly 


Gly 


Tyr 


Val 


Gly 


He 






20 










25 










30 






Val 


He 


Ala 


Ser 


He 


Leu 


Ser 


Ala 


Thr 


Thr 


His 


Ala 


Leu 


Asp 


He 


Lys 






35 










40 










45 








He 


Thr 


He 


Leu 


Val 


Ala 


Phe 


Leu 


Gly 


Asn 


Leu 


He 


Gly 


Ser 


Gly 


Ala 




50 










55 










60 










Leu 


Val 


He 


Phe 


Ala 


Arg 


Tyr 


Gin 


Lys 


Arg 


Glu 


Phe 


Leu 


Lys 


Tyr 


Phe 


65 










70 










75 










80 


Gin 


Lys 


His 


Arg 


Arg 


Lys 


Leu 


Ala 


Leu 


Ala 


Ser 


Leu 


Trp 


Val 


Lys 


Arg 






85 










90 










95 




Tyr 


Ala 


Leu 


Leu 


Met 


He 


Phe 


Val 


Asn 


Lys 


Tyr 


Leu 


Tyr 


Gly 


He 


Lys 






100. 










105 










110 






Ser 


Val 


Val 


Pro 


Leu 


Ala 


He 


Gly 


Phe 


Ser 


Lys 


Tyr 


Pro 


Leu 


Lys 


Lys 






115 










120 










125 








Phe 


Leu 


Trp 


Leu 


Asn 


Val 


Phe 


Ser 


Ser 


Phe 


Leu 


Trp 


Ala 


Leu 


He 


Val 




130 








135 










140 










Gly 


Ser 


Val 


Ser 


Phe 


Gin 


Ala 


Ser 


Asp 


Trp 


Val 


Lys 


Thr 


Leu 


Tyr 


Glu 


145 










150 










155 










160 


Arg 


Leu 


Ser 


His 


Tvr 


Thr 


Ser 


Phe 


Phe 


Val 


He 


Ser 


Phe 


Val 


Leu 


He 








165 










170 










175 
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Ala Leu Leu lie Trp Phe Leu Leu Lys Arq Tyr Ser Arg Lvs Met Gly 
180 185 190 

Phe 



(2) INFORMATION FOR SEQ ID NO: 157: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: prone in 

<iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

l ?_ \ HDP Ji K'T CV . U^*; 

■ ^ v_ ^w^C ^ f j ^ w ^ ^ 

(ix) FEATURE: 

(A) NAME / KEY : misc_f ea ture 

(B) LOCATION 1 . . .12 9 



(xi) SEQUENCE DESCRIPTION: 



Met 


Arg 


Lys 


Gly 


Arg 


Val 


Met 


Leu 


1 








5 








Pro 


Asn 


He 


Ser 
20 


Leu 


Cys 


Lys 


Glu 


Ala 


Leu 


Lys 


He 


Cys 


Glu 


Trp 


Ser 






35 










40 


Gly 


Ser 


Glu 


Phe 


Leu 


Pro 


Leu 


Tyr 




50 










55 


Ala 


Val 


He 


Gly 


Asp 


Asp 


Tyr 


Gly 


65 










70 






Gly 


Gin 


Lys 


His 


Glu 


Asn 


Lys 


Glu 










85 








Leu 


Glu 


Asp 


Phe 


Phe 


Lys 


Tyr 


Phe 








100 










Ser 


Phe 


Xaa 


Gly 


Arg 


Gly 


Phe 


Gly 






115 










120 



Pro 



SEQ ID NO: 157 



Cys 


Val 


Phe 


Asp 


He 


Glu 


Thr 


lie 




10 










15 




His 


Phe 


Gin 


Leu 


Lys 


Glu 


Asp 


Asp 


25 










30 






Phe 


Glu 


Lys 


Gin 


Lys 


Glu 


Lys 


Ser 










45 








Leu 


His 


Glu 


He 


He 


Ser 


He 


Ala 








60 










Gin 


Phe 


He 


Lys 


Val 


Gly 


Asn 


Phe 






75 










80 


Asp 


Phe 


Ala 


Ser 


Glu 


Lys 


Glu 


Leu 




90 










95 




Asn 


Glu 


Lys 


Gin 


Pro 


Arg 


Leu 


He 


105 










110 






Tyr 


Ser 


Pro 


Thr 


His 


Ala 


Gin 


Ser 



125 



(2) INFORMATION FOR SEQ ID NO: 158: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix) FEATURE: 

(A) NAME/ KEY : misc.feature 

(B) LOCATION 1 ... 307 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 

Met Ala Lys Lys Lys He Ala He Ser Cys Gly Asp He Gin Gly Val 

15 10 15 

Gly Leu Glu Leu He Leu Lys Ser His Lys Glu Val Ser Ala Leu Cys 

20 25 30 

Glu Pro Leu Tyr Leu Val His Ser Glu Leu Leu Glu Arg Ala Asn Gin 
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35 










40 










45 








Leu 


Leu 


Asp 


Asn 


Al a 


Tvr 


Glu 


Thr 


Lys 


Thr 


Leu 


Asn 


Ala 


He 


Ala 


He 




50 










55 










60 










Asp 


Ala 


Pro 


Leu 


Pro 


Leu 


Leu 


Asn 


Ser 


Ser 


Thr 


He 


Gly 


Lys 


Val 


Ser 


65 










70 










75 










80 


Thr 


Gin 


Ser 


Glv 


Ala 


* J - 


Ser 


Phe 


Glu 


Ser 


Phe 


Lys 


Lys 


Ala 


Cys 


Glu 










35 










90 










95 




L 6U 


Ala 


Asp 


Ser 




Gl U 


Val 


Asp 


Glv 

vj x y 


He 


Cvs 


Thr 


Leu 


Pro 


He 


Asn 








100 








105 










110 






Ly s 


Leu 


Ala 




Gin 


Gin 


Ala 


Gin 


He 


Pro 


Phe 


Val 


Glv 


His 


Thr 


Asd 




115 










120 










125 








Phe 


Leu 


Lys 


Gin 


y 


Tvr 


Lys 


Asp 


His 


Gin 


He 


lie 


Met 


Met 


Leu 


Gly 




13 0 










13 5 










140 










L.yi=> 


Ser 


Lys 


Leu 


Phe 


Val 


Glv 

\j ± y 


Leu 


Phe 


Ser 


Asp 


His 


Val 


Pro 


Leu 


Ser 


X H Zj 








150 










155 










160 


r\ X d 


Val 


«_> t; X 


Gin 


Leu 


He 


Gin 


Val 


Lys 


Ala 


Leu 


Val 


Lys 


Phe 


Leu 


Leu 










X O _J 










17 0 










175 




Xd 








OCX 


Thx 




Ala 


Lys 




Vol 


Giii 


Val 


\_ y o 


\J X V 


Piie 








180 










185 










190 






Asn 


O T~ 


n x o 


x a 


C ~\ \r 

Ksiy 




Glu 


ni v 

o iy 


Leu 


Phe 


Gly 


Glu 


Glu 


Asp 


Glu 


Lys 






X j D 










200 










205 








lie 


Leu 


Lys 


Ala 


lie 


Pin 


O X LA 




Asn 


Gin 


Thr 


Leu 


Gly 


Phe 


Glu 






210 








215 










220 










Phe 


Leu 


Gly 


Pro 


Leu 


Pro 


Ala 


Asp 


Ser 


Ala 


Phe 


Ala 


Pro 


Asn 


Lys 


Arg 


225 








230 










235 










240 


Lys 


He 


Thr 


Pro 


Phe 


Tyr 


Val 


Ser 


Met 


Ser 


His 


Asp 


Val 


Gly 


Leu 


Ala 








245 










250 










255 




Pro 


Leu 


Lys 


Ala 


Leu 


Tyr 


Phe 


Asp 


Glu 


Ser 


He 


Asn 


Val 


Ser 


Leu 


Asn 








260 










265 










270 






Ala 


Pro 


He 


Leu 


Arg 


Ala 


Ser 


Thr 


Asp 


His 


Gly 


Thr 


Ala 


Phe 


Asp 


He 






275 










280 










285 








Ala 


Tyr 


Gin 


Asn 


Lys 


Ala 


Asn 


His 


Lys 


Ser 


Tyr 


Leu 


Asn 


Ala 


He 


Lys 



290 295 300 

Tyr Leu Ala 
305 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

tii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME /KEY : misc.feature 

(B) LOCATION 1 . . . 146 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 



Met 


Ser 


Ser 


Gly 


Leu 


He 


Tyr 


He 


Ser 


Leu 


Glu 


Val 


Leu 


Val 


Xaa 


Cys 


1 






5 










10 










15 




Leu 


He 


Thr 


Ala 


Leu 


He 


Met 


Tyr 


Tyr 


Val 


Met 


Lys 


Lys 


He 


Tyr 


Tyr 








20 










25 










30 






Ala 


Arg 


Gly 


Gin 


Ala 


He 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 


Lys 


Ala 


Lys 


Leu 






35 










40 










45 








Met 


Glu 


Phe 


Gin 


Ala 


Lys 


Ser 


Phe 


Val 


Glu 


Ala 


Glu 


Glu 


Met 


Arg 


Met 




50 










55 










60 










Lys 


Ser 


Gin 


Glu 


Cys 


Lys 


Leu 


Gin 


Gin 


Gin 


Tyr 


Glu 


Asn 


Lys 


Asn 


Leu 


65 








70 










75 










80 


Gin 


Leu 


Gin 


Thr 


His 


Phe 


Asp 


Lys 


Lys 


Glu 


Ala 


His 


Leu 


Lys 


His 


Leu 










85 










90 










95 




Glu 


Ala 


Gin 


His 


Lys 


Glu 


Phe 


Val 


Arg 


Asp 


Glu 


Lys 


Arg 


Tyr 


Leu 


Glu 
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10C 105 110 

Lys Glu Lvs Lys Glu Leu Glu Lys Glu Arg Gin He Leu Glu Xaa Glu 

115 120 125 

Arg Glu Asn Phe Xaa Xaa Gin Arg Ala Phe Val Xaa Xaa Xaa Xaa Ala 

130 125 140 

Lys Ala 
145 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

f iii j H^^OTF^^ICAL " YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(lx) FEATURE: 

(A) NAME / KEY : misc_feature 
(3) LOCATION 1 . . . 127 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 



Met 


Asn 


He 


Lys 


He 


Leu 


Lys 


He 


Leu 


Val 


Gly 


Gly 


Leu 


Phe 


Phe 


Leu 










5 










10 










15 




Ser 


Leu 


Asn 


Ala 


His 


Leu 


Trp 


Gly 


Lys 


Gin 


Asp 


Asn 


Ser 


Phe 


Leu 


Gly 








20 










25 










30 






He 


Gly 


Glu 


Arg 


Ala 


Tyr 


Lys 


Ser 


Gly 


Asn 


Tyr 


Ser 


Lys 


Ala 


Ala 


Ser 






35 










40 










45 








Tyr 


Phe 


Lys 


Lys 


Ala 


Cys 


Asn 


Asp 


Gly 


Val 


Ser 


Glu 


Gly 


Cys 


Thr 


Gin 




50 










55 










60 










Leu 


Gly 


He 


He 


Tyr 


Glu 


Asn 


Gly 


Gin 


Gly 


Thr 


Arg 


lie 


Asp 


Tyr 


Lys 


65 










70 










75 










80 


Lys 


Ala 


Leu 


Glu 


Tyr 


Tyr 


Lys 


Thr 


Ala 


Cys 


Gin 


Ala 


Asp 


Asp 


Arg 


Glu 










85 










90 










95 




Gly 


Cys 


Phe 


Gly 


Leu 


Gly 


Gly 


Leu 


Tyr 


Asp 


Glu 


Gly 


Leu 


Gly 


Thr 


Ala 








100 










105 










110 






Gin 


Asn 


Tyr 


Gin 


Glu 


Ala 


Xaa 


Asp 


Ala 


Tyr 


Ala 


Arg 


His 


Ala 


Phe 








115 










120 










125 









(2) INFORMATION FOR SEQ ID NO: 161: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(AJ ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . .116 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 

Met Ala lie Ala He Lys Asp Leu Leu Ser Ala Tyr Lys Val Val Leu 
15 10 15 

Pro Leu Asp Lvs He Ser Met Pro Ser Ser Ala Asp Leu Lys Leu Thr 
20 25 30 
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Leu 


Gin 


Phe 


Leu 


Lys 


Asn 


Thr 


Ala 


Pro 


Leu 


Phe 


Ser 


Val 


Gin 


Gly 


Ser 






35 








40 










45 








Val 


Asn 


Leu 


Gin 


Glu 


Gly 


Thr 


Phe 


Ser 


Leu 


Tyr 


Asn 


He 


Pro 


Leu 


Tyr 




50 










55 










60 










Thr 


Gin 


Ser 


Ala 


Gin 


He 


Asn 


Leu 


Asp 


He 


Ala 


Gin 


Glu 


Tyr 


Gin 


Tyr 


65 










70 










75 










80 


lie 


Tyr 


lie 


Asp 


Thr 


He 


His 


Thr 


Arg 


Tyr 


Ala 


Asn 


Met 


Xaa 


Asp 


Leu 










85 










90 










95 




Asp 


Ala 


Lys 


He 


Ala 


Leu 


Asp 


Leu 


Gly 


Gin 


Lys 


Asn 


Leu 


Ser 


Xaa 


Xaa 



100 105 110 



Xaa Leu Gly Pro 
115 

(2) INFORMATION FOR SEQ ID NO: 162: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 
/ p ) T YPE : am ino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: mi sc_ feature 

(B) LOCATION 1 ... 82 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 



Met 


Pro 


Asp 


Asn 


Leu 


His 


Leu 


His 


Thr 


Leu 


Leu 


Xaa 


Lys 


Phe 


Leu 


Gin 


1 








5 










10 










15 




Gin 


Arg 


Ser 


Phe 


Asn 


Tyr 


Pro 


Asn 


Pro 


Leu 


Cys 


Ala 


Phe 


lie 


Leu 


He 








20 










25 










30 






Leu 


Cys 


Asn 


Leu 


Pro 


Phe 


He 


Leu 


He 


Ser 


Val 


Leu 


Phe 


Arg 


Leu 


Asp 






35 










40 










45 








Ala 


Tvr 


Ala 


Leu 


He 


Val 


He 


Ser 


Leu 


Val 


Phe 


lie 


Xaa 


Cys 


Tyr 


Leu 




50 










55 










60 










He 


Gly 


Xaa 


Ala 


Tyr 


Leu 


Asn 


Arg 


Gin 


Val 


Cys 


Ala 


Leu 


Glu 


Lys 


Arg 



70 75 80 



Ala Phe 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . . 116 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 

Leu Lvs Val Thr Asn Pro His Leu Leu Val Val He Gin Asp Leu Asn 

15 10 15 

Ala Arg He Ala Leu Met Lys Leu Leu Phe Gin Asn Val Lys Ser Ala 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WQ 97 1 9099 A 1 I > 



WO 97/19098 



PCT/US96/18542 



159 



20 25 30 



Asn 


Lys 


Glu 


Leu 


Val 


Phe 


Cys 


Asn 


Lys 


Glu 


Lys 


Arg 


Leu 


He 


Arg 


Ser 




35 










40 










45 








Phe 


Asp 
50 


Ala 


Gin 


Lys 


Glu 


Tyr 
55 


Gly 


He 


Thr 


Pro 


Val 
60 


Glu 


Asn 


He 


Leu 


Ser 


Val 


Leu 


Asp 


Thr 


Ala 


Met 


Asn 


Pro 


Asn 


Ser 


Ala 


Leu 


Val 


He 


Asp 


65 








70 










75 










80 


Asn 


Leu 


Asn 


Glu 


Ala 
3 5 


Lys 


Glu 


Leu 


His 


Asp 

90 


Lys 


Val 


Gly 


Ala 


Glu 
95 


Lys 


Leu 


Lys 


Ser 


Phe 


Leu 


Glu 


Lys 


Ala 


Xaa 


Arg 


Gin 


Arg 


Ala 


Val 


Leu 


Arg 






100 










105 










110 






His 


Phe 


Cys 
115 


Ala 



























(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

i ? \ r r^^Ttj - too ~ - »- - ^, ; ^ 

< 4 * / W^**W-L**. ^ S W i 

{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc.feature 

(B) LOCATION 1 . . . 198 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 



Met 


Lys 


Glu 


Ser 


Phe 


Tyr 


He 


Glu 


Gly 


Met 


Thr 


Cys 


Thr 


Ala 


Cvs 


Ser 


1 








5 










10 










15 




Ser 


Gly 


He 


Glu 
20 


Arg 


Ser 


Leu 


Gly 


Arg 
25 


Lys 


Ser 


Phe 


Val 


Lys 
30 


Lys 


He 


Glu 


Val 


Ser 
35 


Leu 


Leu 


Asn 


Lys 


Ser 
40 


Ala 


Asn 


He 


Glu 


Phe 
45 


Asn 


Glu 


Asn 


Glu 


Thr 
50 


Asn 


Leu 


Asp 


Glu 


He 
55 


Phe 


Lys 


Leu 


He 


Glu 
60 


Lys 


Leu 


Gly 


Tyr 


Ser 


Pro 


Lys 


Lys 


Thr 


Leu 


Ala 


Glu 


Giu 


Lys 


Lys 


Glu 


Phe 


Phe 


Ser 


Pro 


65 










70 










75 










80 


Asn 


Val 


Lys 


Leu 


Ala 
85 


Leu 


Ala 


Val 


He 


Phe 
90 


Thr 


Leu 


Phe 


Val 


Val 
95 


Tyr 


Leu 


Ser 


Met 


Gly 
100 


Ala 


Met 


Leu 


Ser 


Pro 
105 


Ser 


Leu 


Leu 


Pro 


Glu 
110 


Ser 


Leu 


Leu 


Thr 


He 
115 


Asn 


His 


His 


Ser 


Asn 
120 


Phe 


Leu 


Asn 


Ala 


Cys 
125 


Leu 


Gin 


Leu 


He 


Gly 
130 


Ala 


Leu 


He 


Val 


Met 
135 


His 


Leu 


Gly 


Arg 


Asp 
140 


Phe 


Tyr 


He 


Gin 


Gly 


Phe 


Lys 


Ala 


Leu 


Trp 


His 


Arg 


Gin 


Pro 


Asn 


Met: 


Ser 


Ser 


Leu 


He 


145 










150 










155 










160 


Ala 


He 


Gly 


Thr 


Ser 
165 


Ala 


Ala 


Leu 


He 


Ser 
170 


Ala 


Cys 


Gly 


Asn 


Cys 
175 


He 


Trp 


Phe 


He 


Pro 
180 


He 


He 


He 


Pro 


He 
185 


Ser 


Gly 


Leu 


Met 


Gly 
190 


He 


He 


He 


Leu 


Lys 
195 


Ala 


Cys 


Ala 























(2) INFORMATION FOR SEQ ID NO: 16 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
(3) LOCATION i ... 85 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165 



Val 


Gly 


lie 


Val 


Pro 


Asp 


Asn 


Leu Trp 


Lys 


Leu 


Lys 


Arg 


Phe 


Asn 


Gin 


1 








5 








10 










15 




Asp 


Trp 


Arg 


Val 


Gly 


Asp 


Thr 


Leu lie 


Thr 


Ala 


He 


Gly 


Gin 


Gly 


Ser 








20 








25 










30 






Phe 


Leu 


Ala 


Thr 


Pro 


Leu 


Gin 


Val Leu 


Ala 


Tyr 


Thr 


Gly 


Leu 


He 


Ala 






"5 C 










40 








A C 
•a 








Thr 


Gly 


Lys 


Leu 


Ala 


Thr 


Pro 


His Phe 


Ala 


He 


His 


Asn 


Gin 


Gin 


Pro 




50 










55 








60 










Leu 


Lys 


Asp 


Pro 


Leu 


Asn 


Arg 


Phe Ser 


Lys 


Lys 


Glu 


Ala 


Pro 


Ser 


Leu 


65 










70 








75 










80 


Ala 


Arg 


Gly 


His 


Val 






























35 






















\2) 


INFORMATION 


FOR 


SEQ 


ID : 


NO: 166 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



• (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc.feacure 

(B) LOCATION 1 . . . 343 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 



Met 


Gin 


Asn 


Leu 


Leu 


He 


Gin 


Ala 


Glu 


Asn 


Ala 


He 


Ala 


Leu 


Leu 


Phe 


1 








5 










10 










15 




Leu 


Leu 


Asn 


Asp 


Lys 


Asn 


Leu 


Lys 


Gly 


Lys 


He 


Asp 


Leu 


He 


Tyr 


He 








20 










25 










30 






Asp 


Pro 


Pro 


Phe 


Ala 


Thr 


Asn 


Asn 


His 


Phe 


Thr 


He 


Thr 


Asn 


Gly 


Arg 






35 










40 










45 








Ala 


Thr 


Thr 


He 


Ser 


Asn 


Ser 


Lys 


Asn 


Gly 


Asp 


He 


Ala 


Tyr 


Ser 


Asp 




50 










55 










60 










Lys 


Val 


Val 


Gly 


Met 


Asp 


Phe 


Met 


Glu 


Phe 


Leu 


Lys 


Gin 


Arg 


Leu 


Val 


65 










70 










75 










80 


Leu 


Leu 


Lys 


Glu 


Leu 


Leu 


Ser 


Glu 


Gin 


Gly 


Ser 


He 


Tyr 


Val 


His 


Thr 










85 










90 










95 




Asp 


Tyr 


Lys 


He 


Gly 


His 


Tyr 


Val 


Lys 


Val 


Met 


Leu 


Asp 


Glu 


lie 


Phe 








100 










105 










110 






Gly 


He 


Gin 


Asn 


Phe 


Arg 


Asn 


Glu 


He 


Thr 


Arg 


lie 


Lys 


Cys 


Asn 


Pro 






115 










120 










125 








Lys 


Asn 


Phe 


Lys 


Arg 


He 


Gly 


Tyr 


Gly 


Asn 


lie 


Lys 


Asp 


Met 


lie 


Leu 




130 










135 










140 










Phe 


Tyr 


Ser 


Lys 


Gly 


Lys 


Asn 


Pro 


He 


Phe 


Asn 


Glu 


Pro 


Lys 


lie 


Pro 


145 










150 










155 










160 


Tyr 


Thr 


Pro 


Gin 


Asp 


Leu 


Glu 


Lys 


Arg 


Phe 


Pro 


Lys 


He 


Asp 


Lvs 


Asp 










165 










170 










175 




Lys 


Arg 


Arg 


Tyr 


Thr 


Thr 


Val 


Pro 


He 


His 


Ala 


Pro 


Gly 


Glu 


Val 


Glu 



180 185 190 
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Ser 


Gly 


Glu 


Cys Ser 


Lys 


Ala 


Phe 


Lys 


Gly 


Met 


Leu 


Pro 


Pro 


Lys 


Gly 






195 








2 00 










205 








Arg 


His 


Trp 


Arg Thr 


Asp 


He 


Ala 


Thr 


Leu 


Glu 


Arg 


Trp 


Asp 


Lys 


Glu 




210 








215 










220 










Gly 


Leu 


lie 


Glu Tyr 


Ser 


Asn 


Asn 


Asn 


Asn 


Pro 


Arg 


Lys 


Lys 


He 


Tyr 


22 5 








23 0 










23 5 










24 0 


Ala 


Leu 


Glu 


Gin Val 


Gly 


Lys 


Arg 


Val 


Gin 


Asp 


He 


Trp 


Glu 


Phe 


Lys 








245 










2 50 










25 5 




Asp 


Pro 


Gin 


Tyr Pro 


Ser 


Tyr 


Pro 


Thr 


Glu 


Lys 


Asn 


Ala 


Gin 


Leu 


Leu 








260 








265 










270 






Asp 


Leu 


He 


He Lys 


Thr 


Ser 


Ser 


Asn 


Lys 


Asp 


Ser 


He 


Val 


Leu 


Asp 






275 








280 










285 








Cys 


Phe 


Cys 


Gly Ser 


Gly 


Thr 


Thr 


Leu 


Lys 


Ser 


Ala 


Phe 


Leu 


Leu 


Gin 




290 








295 










300 










Arg 


Lys 


Phe 


He Gly 


He 


Asp 


Asn 


Ser 


Asp 


Leu 


Ala 


He 


Gin 


Ala 


Cys 


305 








310 










315 










320 


Lys 


Asn 


Lys 


Leu Glu 


Thr 


lie 


Thr 


Lys 


Asp 


Leu 


Phe 


Val 


Ser 


Gin 


Asn 








2 25 










2 3 0 










■> c 




Phe 


Tyr 


Asp 


Phe Leu 


Val 


Phe 


























340 
























£2) 


INFORMATION FOR 


SEQ 


ID NO: 167 : 




















SEQUENCE CHARACTERISTICS: 




















(A) LENGTH: 17 6 amino 


acids 


















(B) TYPE: 


amino acid 






















{ D) TOPOLOGY: 


linear 






















MOLECULE TYPE: 


protein 




















( ii i ) 


HYPOTHETICAL: YES 






















(vi) 


ORIGINAL SOURCE: 
























(A) ORGANISM: 


Helicobacter pylori 














<ix> 


FEATURE : 




























(A) NAME /KEY : 


misc_f eature 


















(B) LOCATION ] 


L . . .176 




















(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 167 










Met 


Thr 


Ser 


Val Val 


He 


Lys 


Pro 


His 


Ala 


Tyr 


Gly 


Glu 


Gin 


Val 


Gin 


1 






5 










10 










15 




Glu 


lie 


Glu 


Glu Glu 


Ser 


Asp 


Ser 


Asp 


Tyr 


Glu 


Lys 


Asn 


Asn 


Asp 


Gin 








20 








25 










30 






Glu 


Ala 


He 


Asn Phe 


Gly 


He 


Ala 


Leu 


His 


Lys 


Gly 


Leu 


Glu 


Tyr 


Gin 






35 








40 










45 








Tyr 


Ala 


Tyr 


Asn He 


Pro 


Lys 


Gin 


Ser 


Val 


Leu 


Glu 


Tyr 


Leu 


Asn 


Tyr 




50 








55 










60 








His 


Tyr 


Gly 


Phe Tyr 


Gly 


Leu 


Asp 


Tyr 


Gin 


Ala 


Leu 


Glu 


Glu 


Ser 


Leu 


65 








70 










75 










80 


Glu 


Leu 


Phe 


Glu Asn 


Asp 


Ala 


Gly 


He 


Gin 


Ala 


Leu 


Phe 


Lys 


Asn 


His 








85 










90 










95 




Ala 


Leu 


Lys 


Gly Glu 


Ala 


Ala 


Phe 


Leu 


Phe 


Gin 


Gly 


Val 


Val 


Ser 


Arg 








100 








105 










110 




He 


Asp 


Val 


Leu Leu 


Trp 


Asp 


Arg 


Gly 


Gin 


Asn 


Leu 


Tyr 


Val 


Leu 


Asp 






115 








120 










125 








Tyr 


Lys 


Ser 


Ser Gin 


Asn 


Tyr 


Gin 


Gin 


Ser 


His 


Lys 


Ala 


Gin 


Val 


Ser 




130 








135 










140 










His 


Tyr 


Ala 


Glu Phe 


Leu 


Arg 


Thr 


Gin 


Xaa 


Pro 


His 


Phe 


Lys 


He 


Gin 


145 








150 










155 










160 


Ala 


Gly 


He 


He Tyr 


Ala 


His 


Lys 


Arg 


Leu 


Leu 


Glu 


Lys 


Xaa 


Trp 


Xaa 








165 










170 










175 




(2) 


INFORMATION FOR 


SEQ 


ID NO: 16 8 : 


















(i) 


SEQUENCE CHARACTERISTICS: 

















(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME /KEY : misc.feature 

(B) LOCATION 1 . . . 260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 8 



iufcr L- 


Ser 


G lu 


As p 


Leu 


Prn 

Sr L U 


• lit; 


rt x d 


Cqv- 


Asp 


CTo r- 


r 1 n 


rile 


i Hi 


Tyr 


Asn 


1 

X 








D 










x u 










l d 






V = " 


c. <r. v- 


— {_ 




A - . 


p 




n5u 


G j. u 


* , -, 
v a _l 




Asp 


Vai 


lit: 


Sex 




























o U 






Kj ly 


vol 


Asn 


lie 


i n r 


Leu 


blu 


bin 


1 LIT 


Tnr 


bill 


Pro 


Asn 


Lys 


Pro 


Ala 
















u 










4 b 








x x e 


x x e 


oer 


val 


Ser 


Arg 


Asp 


Asn 




Ala 


lie 


lie 


Asp 


Ser 


Leu 


Lys 




t; n 
b U 










b b 










6 0 








biU 


rne 


Val 


Lys 


Ala 


Tyr 


Asn 


Glu 


Leu 


He 


Pro 


Lys 


Leu 


Asp 


Glu 


Asp 


65 










70 










75 










80 


Thr 


Arg 


Tyr 


Asp 


Ala 


Asp 


Thr 


Lys 


He 


Ala 


Gly 


He 


Phe 


Asn 


Gly 


Val 




















q n 
y u 










y b 




Gly 


Asp 


He 


Arg 


Ala 


He 


Arg 


Ser 


Ser 


Leu 


Asn 


Asn 


Val 


Phe 


Ser 


Tyr 








100 










105 










110 






Ser 


Val 


His 


Thr 


Asp 


Asn 


Gly 


Val 


Glu 


Ser 


Leu 


Met 


Lys 


Tyr 


Gly 


Leu 






115 










120 










125 






Ser 


Leu 


Asp 


Asp 


Lys 


Gly 


Val 


Met 


Ser 


Leu 


Asp 


Glu 


Ala 


Lys 


Leu 


Ser 




130 










135 










140 










Ser 


Ala 


Leu 


Asn 


Ser 


Asn 


Pro 


Lys 


Ala 


Thr 


Gin 


Asp 


Phe 


Phe 


Tyr 


Gly 


145 










150 










155 










160 


Ser 


Asp 


Ser 


Lys 


Asp 


Met 


Gly 


Gly 


Arg 


Glu 


He 


His 


Gin 


Glu 


Gly 


He 










165 










170 










175 




Phe 


Ser 


Lys 


Phe 


Asn 


Gin 


Val 


He 


Ala 


Asn 


Leu 


He 


Asp 


Gly 


Gly 


Asn 








180 










185 










190 






Ala 


Lys 


Leu 


Lys 


He 


Tyr 


Glu 


Asp 


Ser 


Leu 


Asp 


Arg 


Asp 


Ala 


Lys 


Ser 






195 










200 










205 








Leu 


Thr 


Lys 


Asp 


Lys 


Glu 


Asn 


Ala 


Gin 


Glu 


Leu 


Leu 


Lys 


Thr 


Arg 


Tyr 




210 










215 










220 










Asn 


He 


Met 


Ala 


GIu 


Arg 


Phe 


Ala 


Ala 


Tyr 


Asp 


Ser 


Gin 


lie 


Ser 


Lys 


225 










230 










235 










240 


Ala 


Asn 


Gin 


Lys 


Phe 


Asn 


Ser 


Val 


Gin 


Met 


Met 


He 


Asp 


Gin 


Ala 


Ala 










245 










250 










255 




Ala 


Lys 


Lys 


Asn 
































260 


























(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 169 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

-(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 ... 145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 
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Met Arg He Val Phe Met Gly Thr Pro Ser Phe Ala Glu Val He Leu 

IS 10 15 

Arg Ala Leu Val Glu Asn Glu A.sp Lys Lys He Glu Val Val Gly Leu 

20 25 30 

Phe Thr Gin Arg Asp Lys Pro Phe Gly Arg Lys Lvs Glu Leu Lvs Ala 

35 40 45 

Pro Glu Thr Lys Thr Tyr He Leu Glu Asn His Leu Asn He Pro He 

50 55 60 

Phe Gin Pro Gin Ser Leu Lys Glu Pro Glu Val Gin He Leu Lys Gly 
65 70 75 80 

Leu Lys Pro Asp Phe He Val Val Val Ala Tyr Gly Lys He Leu Pro 

85 90 95 

Lys Glu Val Leu Thr He Ala Pro Cys He Asn Leu His Ala Ser Leu 

100 105 110 

Leu Pro Lys Tyr Arg Gly Ala Ser Pro He His Glu Met He Leu Asn 
115 120 125 



... ' T — ml w „ k. t... . 

~>^j — i.iu ncu jjcu lutrL aqo riic biy lie 

130 135 140 

Gly 
145 

(2) INFORMATION FOR SEQ ID NO: 17 0: 

{ l ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

fvi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME /KEY : misc.f eature 

(B) LOCATION 1 . . .248 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 



Met 


Arg 


Phe 


Tyr 


Phe 


Lys 


Phe 


Leu 


Trp 


Leu 


Leu 


Gly 


He 


Phe 


Leu 


He 


1 








5 










10 










15 




Phe 


Tyr 


Phe 


Leu 


Asp 


He 


Lys 


Gly 


Ser 


Ser 


Ser 


Tyr 


He 


Ser 


Asp 


Arg 








20 










25 










30 


Val 


Lys 


Ser 


Ala 


Leu 


Met 


Ser 


Ala 


Lys 


Asn 


Ser 


Leu 


Leu 


Asp 


Asn 


Val 






35 










40 










45 






Gin 


Ala 


Tyr 


Phe 


Phe 


Gin 


Ala 


Gin 


Asn 


He 


Lys 


Glu 


Phe 


Gin 


Lys 


Glu 




50 










55 










60 








Arg 


Leu 


He 


Leu 


Glu 


Ala 


Leu 


Lys 


Leu 


Glu 


Asn 


Ala 


Asp 


Leu 


Lys 


Glu 


65 










70 










75 








80 


Arg 


Leu 


Asn 


Ser 


He 


Tyr 


Pro 


Leu 


Glu 


Asn 


Pro 


Lys 


Met 


Thr 


Tyr 


Thr 










85 










90 










95 




Pro 


Thr 


Phe 


Met 


Thr 


Ser 


Phe 


He 


Asn 


Leu 


Glu 


Asp 


Thr 


His 


Ser 


Val 








100 










105 










110 






Ser 


Leu 


Asn 


Pro 


He 


Val 


Asn 


Leu 


Glu 


Glu 


Asn 


Lys 


He 


Tyr 


Gly 


Leu 






115 










120 










125 






Val 


Ser 


His 


Asn 


Gin 


Ala 


He 


Gly 


He 


Ala 


Val 


Leu 


Glu 


Lys 


Gly 


Arg 




130 










135 










140 






Leu 


Asn 


Gly 


Phe 


Leu 


Asn 


Ala 


His 


Lys 


Arg 


Cys 


Ala 


Tyr 


Ser 


Val 


Met 


145 










150 










155 










160 


He 


Gly 


Gin 


Asn 


Gin 


Val 


Leu 


Gly 


Phe 


He 


Gly 


Thr 


Asn 


Phe 


Lys 


Gin 










165 










170 










175 




Glu 


Leu 


Val 


Val 


Asp 


Phe 


He 


Val 


Pro 


Ser 


Ala 


Glu 


He 


Asn 


He 


Gly 








180 










185 










190 






Asp 


Gin 


Val 


Leu 


Thr 


Ser 


Gly 


Leu 


Asp 


Gly 


He 


Phe 


Gly 


Ala 


Gly 


Val 






195 










200 








205 






Phe 


Val 


Gly 


Glu 


Val 


Ser 


Ser 


Val 


Glu 


Asp 


His 


Tyr 


Thr 


Tyr 


Lys 


Ser 
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210 215 220 

Ala Val Leu Lys Asn Ala Phe Leu Ser GIu Ala Lvs Leu Leu Arg His 
225 230 235 * 240 

Val Phe Leu Ser Gly Val Lys Asn 
245 

(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL ; YES 

} t c i 7 \ T ^cu^c 17 ■ 

(A) ORGANISM: Helicobacter pylori 

{ix) FEATURE: 

(A) NAME /KEY : misc.feature 

(B) LOCATION 1 ... 119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 



Leu 


Ala 


Leu 


Arg 


Leu 


Pro 


Phe 


Leu 


He 


Ala 


His 


Val 


He 


Asn 


Met 


Phe 


1 








5 










10 










15 




Leu 


Phe 


Tyr 


Leu 


He 


Gly 


Arg 


Lys 


He 


Leu 


Lys 


Lys 


Pro 


Lys 


Asp 


Ala 








20 










25 










30 






Leu 


Tyr 


Val 


Val 


Leu 


Thr 


Tyr 


Ala 


Leu 


Leu 


Pro 


Gly 


Val 


Asn 


Leu 


Phe 






35 










40 










45 








Ala 


He 


Leu 


Leu 


Ala 


Lys 


Ser 


Val 


Leu 


Val 


Leu 


Ser 


Leu 


Gly 


Leu 


Leu 




50 










55 










60 










He 


Ser 


Tyr 


Leu 


Tyr 


He 


Lys 


Thr 


Gin 


Lys 


He 


Pro 


Tyr 


Leu 


Thr 


Leu 


65 










70 










75 










80 


Ser 


Ala 


Cys 


Ala 


Phe 


Leu 


Asp 


Gly 


Ala 


Phe 


He 


Pro 


Leu 


Leu 


Leu 


Gly 










85 










90 










95 




Val 


Phe 


Ala 


Tyr 


Ala 


Leu 


Arg 


Lys 


Thr 


Ala 


He 


Leu 


Arg 


Ala 


Arg 


Ser 








100 










105 










110 






Leu 


Leu 


Trp 


Trp 


Phe 


Xaa 


Leu 





















115 



(2) INFORMATION FOR SEQ ID NO; 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_ feature 
<B) LOCATION 1 . . .10 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 

Val Asn Leu Met Asp Tyr Phe Ser Lys Ser Leu Phe Leu Asn Ser Leu 

15 10 15 

Asn Thr Gin Arg Leu lie Val Ser Asn Lys Leu Ala He Asp Val Gin 

20 25 30 

Tyr Gly Met Leu Gin Ser Val Arg Lys Asn Tyr Pro Asp Val Val Asp 

35 40 " 45 
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Gly 


Gly Val Arg 


Glu 


Gly 


Pro Phe Trp 


Val 


Leu 


Ala 


Gly 


Ala 


Leu 


Met 




5 0 






5 5 






6 0 










Pro 


Ser He Leu 


He 


Glu 


He Gly Tyr 


Asn 


Ser 


His 


Ala 


He 


Glu 


Ser 


65 






70 






75 










80 


Lys 


Arg He Gin 


Ser 


Lys 


Pro Tyr Gin 


Lys 


He 


Leu 


Ala 


Lys 


Gly 


lie 






85 






90 










95 




Ala 


Asp Gly lie 


Asp 


Ser 


Phe Phe Ser 


Lys 


Asn 


Asp 












100 






105 
















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 173 : 

















(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( IX) FEATURE: 

(A) NAME / KEY : mi sc_f eature 

(B) LOCATION 1 . . . 157 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 3 



Leu 


Ala 


Ser 


Arg 


Tyr 


Ser 


Val 


Ala 


Val 


Gly 


Asn 


Leu 


Phe 


Ser 


Glu 


His 


1 








5 










10 










15 




Leu 


Tyr 


Asp 


Leu 


Arg 


Asn 


Glu 


Thr 


Met 


Thr 


Asn 


Leu 


He 


Gly 


Phe 


Leu 








20 










25 










30 






Leu 


Val 


Leu 


Ala 


Ser 


He 


Trp 


Val 


Phe 


Phe 


Leu 


Ala 


Leu 


Gly 


Val 


Leu 






35 










40 










45 








Leu 


Gly 


Lys 


Met 


Leu 


Val 


Phe 


Ser 


Gly 


Leu 


Gly 


He 


He 


Asp 


Lys 


Ala 




50 










55 










60 










Leu 


Gly 


Phe 


lie 


Phe 


Ser 


Cys 


Leu 


Lys 


Thr 


Phe 


Leu 


Val 


Leu 


Ser 


Phe 


65 










70 










75 










80 


He 


Leu 


Tyr 


Ala 


Leu 


Ser 


Lys 


Met 


Asp 


Leu 


Met 


Lys 


Asp 


Ala 


Asn 


Ala 










85 










90 










95 




Tyr 


Leu 


Gin 


Glu 


Lys 


Xaa 


Xaa 


He 


Phe 


Pro 


Thr 


Xaa 


Lys 


Xaa 


Xaa 


Xaa 








100 










105 










110 






Ser 


Lys 


He 


Met 


Arg 


Leu 


Asp 


Gly 


Val 


Lys 


His 


Val 


Glu 


Lys 


Asn 


Leu 






115 










120 










125 








Lys 


ASD 


Asn 


Leu 


Glu 


Glu 


Met 


Ser 


Asp 


Glu 


Val 


Lys 


Asn 


Lys 


Gly 


Ser 




130 










135 










140 










He 


Asp 


Asn 


Ala 


Lys 


Glu 


Ser 


Phe 


Asn 


Lys 


Gly 


Tyr 


Gly 








145 










150 










155 












(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 174 : 

















<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix) FEATURE: 

(A) NAME / KEY : misc.feature 

(B) LOCATION 1 ... 81 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:174 



SUBSTITUTE SHEET (RULE 26) 
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Leu 


Ser 


Lys 


Gin 


Ser 


Ala 


Asp 


He 


Val 


He 


Thr 


Asn 


Asp 


Ser 


Leu 


Ser 


1 








5 










10 










15 




Ser 


Leu 


Val 


Lvs 


Val 


Leu 


Ala 


He 


Ala 


Lys 


Lys 


Thr 


Lys 


Ser 


He 


Thr 








20 










25 










30 






Trp 


Gin 


Asn 


He 


Leu 


Phe 


Ala 


Leu 


Gly 


He 


Lys 


Ala 


Val 


Phe 


He 


Val 






35 










40 










45 








Leu 


Gly 


Leu 


Met 


Gly 


Val 


Ala 


Ser 


Leu 


Trp 


Glu 


Ala 


Val 


Phe 


Gly 


Asp 




50 










55 










60 










Val 


Gly 


Val 


Thr 


Leu 


Leu 


Ala 


Leu 


Ala 


Asn 


Ser 


Xaa 


Arg 


Thr 


Mec 


Arg 



65 70 75 80 

Ala 



(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : CC amine acid"; 

(B) TYPE : amino acid 
( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 80 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 175 



Met 


Lys 


Asn 


Leu 


Arg 


His 


Phe 


Arg 


Lys 


Leu 


He 


Ala 


Phe 


Leu 


Gly 


Phe 


1 








5 










10 










15 




Ser 


Pro 


Leu 


Leu 


Leu 


Gin 


Ala 


Asp 


Met 


Thr 


Thr 


Phe 


Phe 


Asn 


Ser 


He 








20 










25 










30 






Glu 


Gin 


Gin 


Leu 


Thr 


Ser 


Pro 


Thr 


Ala 


Lys 


Gly 


He 


Leu 


Met 


Val 


He 






35 










40 










45 








Phe 


Leu 


Gly 


Leu 


Ala 


He 


Phe 


He 


Trp 


Lys 


Asn 


Leu 


Asp 


Arg 


Trp 


Lys 




50 










55 










60 










Glu 


He 


Leu 


Met 


Thr 


Val 


Leu 


Ala 


Leu 


Lys 


Xaa 


Val 


Pro 


Met 


Gin 


Xaa 


65 










70 










75 










80 


(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 176 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .32 5 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 176 

Leu Ala Gly Leu Xaa Val Gly Cys Xaa Arg Met Lys Gin Thr Phe Trp 

15 10 15 

Xaa Leu Ser Tro Gly Glu Lys Ser Gin Lys Val Cys Val His Arg Pro 
2Q~ 25 30 
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Trp 


Tyr 


Ala 


lie 


Trp 


Ser 


Cys 


Asp 


Lys 


Trp 


Glu 


Glu 


Lys 


Thr 


Gin 


Gin 






3 5 










40 










45 








Phe 


Thr 


Gly 


Asn 


Gin 


Leu 


lie 


Thr 


Lys 


Thr 


Trp 


Ala 


Gly 


Gly 


Asn 


Ala 




5 0 










55 










60 










Asn 


Tyr 


Tyr 


His 


Ser 


Gin 


Asn 


Asn 


Gin 


Asp 


lie 


Thr 


Ala 


Asn 


Leu 


6 5 










70 










75 










80 


Lys 


Asn 


Asp 


Asn 


Gly 


Thr 


Tyr 


Phe 


Leu 


Ser 


Gly 


Leu 


Tyr 


Asn 


Tyr 


Thr 










8 5 










90 










95 




u i y 


u j.y 


o i u 


i yr 


Asn 


r l w 
ir±y 


u ly 


Asn 


Leu 


Asp 


1 le 


Glu 


Leu 


Gly 


Ser 


Asn 








1 u u 










105 










110 






Ala 


Thr 


Phe 


Asn 


Leu 


Gly 


Ala 


Ser 


Ser 


Gly 


Asn 


Ser 


Phe 


Thr 


Ser 


Trp 






115 










120 










125 






Tyr 


Pro 


Asn 


Gly 


His 


Thr 


Asp 


Val 


Thr 


Phe 


Ser 


Ala 


Gly 


Thr 


He 


Asn 




13 0 










13 5 










140 








Val 


Asn 


Asn 


Ser 


Val 


Glu 


Val 


Gly 


Asn 


Arg 


Val 


Gly 


Ser 


Gly 


Ala 


Gly 


14 5 










150 










155 








160 


Thr 


His 


Thr 


Gly 


Thr 


Ala 


Thr 


Leu 


Asn 


Leu 


Asn 


Ala 


Asn 


Lys 


Val 


Thr 










165 










170 










175 




lie 


Asn 


Ser 


Asn 


lie 


Ser 


Ala 


Tyr 


Lys 


Thr 


Ser 


Gin 


Val 


Asn 


Val 


Gly 








180 










185 










190 




Asn 


Ala 


Asn 


Ser 


Val 


I l.e 


Thr 


lie 


Asn 


Ser 


Val 


Ser 


Leu 


Asn 


Gly 


Glu 






195 










200 










205 






Thr 


Cys 


Xaa 


Ser 


Leu 


Ala 


Arg 


Val 


Gly 


Val 


Gly 


Ala 


Asn 


Cys 


Ser 


Thr 




210 










215 










220 








Ser 


G ly 


Pro 


Ser 


Tyr 


Ser 


Phe 


Lys 


Gly 


Thr 


Thr 


Asn 


Ala 


Thr 


Asn 


Thr 


225 










230 










235 










240 


Thr 


Phe 


Ser 


Xaa 


Ser 


Ser 


Gly 


Ser 


Phe 


Thr 


Phe 


Glu 


Glu 


Asn 


Ala 


Thr 










245 










250 










255 




Phe 


Ser 


Gly 


Ala 


Lys 


Leu 


Asn 


Gly 


Gly 


Ala 


Phe 


Thr 


Phe 


Asn 


Lys 


Lys 








260 










265 










270 


Phe 


Asn 


Ala 


Thr 


Asn 


Asn 


Thr 


Ala 


Phe 


Asn 


Ser 


Gly 


Ser 


Phe 


Thr 


Phe 






275 










280 








285 








Lys 


Gly 


Thr 


Ser 


Ser 


Phe 


Asn 


Gly 


Ala 


Asn 


Phe 


Ser 


Asn 


Ala 


Ser 


Tyr 




290 










295 










300 








Thr 


Phe 


Asn 


Asn 


Gin 


Ala 


Thr 


Phe 


Gin 


Asn 


Ser 


Ser 


Phe 


Asn 


Gly 


Gly 


305 










310 










315 










320 


Thr 


Phe 


Thr 


Phe 


Asn 

























325 



(2) INFORMATION FOR SEQ ID NO: 177; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 271 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 . . . 271 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177 



Leu 


Leu 


Ser 


Leu 


Val 


Lys 


Gly 


Lys 


Thr 


Met 


Leu 


Arg 


Ser 


Leu 


Tyr 


Ser 


1 








5 










10 










15 




Ala 


Thr 


Ser 


Gly 


Met 


Leu 


Ala 


Gin 


Gin 


Thr 


His 


He 


Asp 


Thr 


Thr 


Ser 








20 










25 








30 






Asn 


Asn 


He 


Ala 


Asn 


Val 


Asn 


Thr 


Thr 


Gly 


Phe 


Lys 


Lys 


Ser 


Arg 


Ala 






35 










40 










45 








Asp 


Phe 


Asn 


Asp 


Leu 


Phe 


Tyr 


Gin 


Ala 


Met 


Gin 


Tyr 


Ala 


Gly 


Thr 


Asn 




50 










55 










60 








Thr 


Ser 


Asn 


Thr 


Thr 


Leu 


Ser 


Pro 


Asp 


Gly 


Met 


Glu 


Val 


Gly 


Leu 


Gly 


65 










70 










75 








80 
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Val 


Arg 


Pro 


Ser 


Ala 


He 


Thr 


Lvs 


Met 


Phe 


Ser 


Gin 


Gly 


Ser 


Pro 


Lys 










35 










90 










95 




Glu 


Thr 


Glu 


Asn 


Asn 


Leu 


Asd 


lie 


Ala 


He 


Thr 


Gly 


Lys 


Gly 


Phe 


Phe 








100 










105 










110 






Gin 


Val 


Gin 


Leu 


Pro 


Asp 


Gly 


Thr 


Thr 


Ala 


Tyr 


Thr 


Arg 


Ser 


Gly 


Asn 






115 










120 










125 








Phe 


Lvs 


Leu 


Asd 


Glu 


Gin 


Glv 


Asn 


Leu 


Val 


Thr 


Ser 


Glu 


Gly 


Tyr 


Leu 




130 










135 










140 










Leu 


lie 


Pro 


Gin 


He 


Thr 


Leu 


Pro 


Glu 


Asp 


Thr 


Thr 


Gin 


Val 


Asn 


He 


145 










150 










155 










160 


Glv 


Val 


Asd 


Glv 


Thr 


Val 


Ser 


Val 


Thr 


Gin 


Gly 


Leu 


Gin 


Thr 


Thr 


Ser 










165 










170 










175 




Asn 


Val 


I le 


Glv 


Gin 


He 


Thr 


Leu 


Ala 


Asn 


Phe 


Val 


Asn 


Pro 


Ala 


Glv 








180 










185 










190 






Leu 


His 


Ser 


Met 


Gly 


Asp 


Asn 


Leu 


Phe 


Ser 


He 


Thr 


Asn 


Ala 


Ser 


Gly 






195 










200 










205 








Asp 


Ala 


lie 


Val 


Gly 


Asn 


Pro 


Asp 


Ser 


Gin 


Gly 


Leu 


Gly 


Lys 


Leu 


Arg 




2 1 C 










215 










22 0 










Gin 


Gly 


Phe 


Leu 


Glu 


Leu 


Ser 


Asn 


Val 


Arg 


Leu 


Val 


Glu 


Glu 


Met 


Thr 


225 










230 










235 










240 


Asp 


Leu 


He 


Thr 


Ala 


Gin 


Arg 


Ala 


Tyr 


Glu 


Ala 


Asn 


Ser 


Lys 


Ser 


He 










245 










250 










255 




Gin 


Thr 


Ala 


Asp 


Ala 


Met 


Leu 


Gin 


Thr 


Val 


Asn 


Ser 


Leu 


Lys 


Arg 





260 265 270 

{2) INFORMATION FOR SEQ ID NO: 17 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 ... 90 



(xil SEQUENCE DESCRIPTION: SEQ ID NO: 17 8 



Met 


Gin 


Asn 


Gly 


Tyr 


Tyr 


Ala 


Ala 


Thr 


Gly 


Ala 


Met 


Ala 


Thr 


Gin 


Phe 


1 








5 










10 










15 




Asn 


Arg 


Leu 


Asd 


Leu 


Thr 


Ser 


Asn 


Asn 


Leu 


Ala 


Asn 


Leu 


Asn 


Thr 


Asn 








20 










25 










30 






Gly 


Phe 


Lys 


Arg 


Asp 


Asp 


Ala 


He 


Thr 


Gly 


Asp 


Phe 


Leu 


Arg 


Leu 


Tyr 






35 










40 










45 








Gin 


Glu 


Tyr 


Arg 


Glu 


Gin 


Leu 


Pro 


Leu 


Glu 


Asp 


Gin 


Thr 


Lys 


Ala 


Ser 




50 










55 










60 










Ala 


Lys 


Tyr 


Leu 


Asn 


Arg 


Xaa 


Leu 


Asn 


Arg 


Val 


Pro 


He 


Leu 


Ser 


Xaa 


65 










70 










75 










80 


He 


Tyr 


Thr 


Xaa 


Arg 


Xaa 


Leu 


Gly 


Xaa 


Val 






















85 










90 














(2) 


INFORMATION 


FOR 


SEQ 


ID : 


NO: 179 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: rr.i sc_ f ea ture 

(B) LOCATION 1. . .195 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 



Val 


Gly 


Ala 


Met 


Pro 


Thr 


He 


Gin 


He 


Ar a 


Xaa 


Phe 


Gly 


Ala 


Gly 


Gly 


1 








5 










10~ 










15 




Ser 


Gly 


His 


Ser 


Asp 


Ala 


Thr 


Leu 


Met 


Leu 


Val 


Asn 


Gly 


He 


Pro 


Val 








20 










25 










30 






Tyr 


Met 


Ala 


Pro 


Tyr 


Ala 


His 


He 


Glu 


Leu 


Asp 


He 


Phe 


Pro 


Val 


Thr 






35 










40 










45 








Phe 


Gin 


Ala 


lie 


Asp 


Arg 


He 


Asp 


Val 


He 


Lys 


Gly 


Gly 


Gly 


Ser 


Val 




50 










55 










60 










G lii 


Tyr 




:i*v 


Asn 


Thr 


Tyr 


<— i . . 

uly 


Gly 


j. i. e 


Val 




He 


I 1 cr 


Thr 


Ly 


65 










70 










75 










80 


Pro 


He 


Pro 


Asn 


Gin 


Trp 


Glu 


Asn 


Gin 


Ala 


Ala 


Glu 


Arg 


Xaa 


Thr 


Tyr 










85 










90 










95 


Trp 


Ala 


Lys 


Ala 


Arg 


Asn 


Ala 


Gly 


Phe 


Ala 


Ala 


Pro 


Xaa 


Asr> 


Lys 


Thr 








100 










105 










no 






Gly 


Asp 


Pro 


Ser 


Phe 


He 


Lys 


Ser 


Leu 


Gly 


Asn 


Asn 


Leu 


Leu 


Tyr 


Asn 






115 










120 










125 








Thr 


Tyr 


Val 


Arg 


Ser 


Gly 


Gly 


Met 


He 


Asn 


Lys 


His 


Val 


Gly 


He 


Gin 




130 










135 










140 










Arg 


Lys 


Leu 


Thr 


Gly 


Leu 


Glu 


Ala 


Lys 


Ala 


Leu 


Gly 


Thr 


He 


Ala 


Pro 


145 










150 










155 










160 


Leu 


Val 


Phe 


Gin 


Thr 


He 


Gly 


Trp 


Met 


Gly 


Ser 


Met 


Thr 


Ser 


Met 


Lys 










165 










170 










175 




Ala 


Met 


Gly 


Leu 


Lys 


Pro 


He 


Thr 


Asn 


Thr 


Thr 


He 


Leu 


Ala 


He 


Xaa 



180 185 190 



Gin Pro Gly 
195 

(2) INFORMATION FOR SEQ ID NO: 18 0: 

<i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 84 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 84 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 



Met 


Arg 


Xaa 


Glu 


Lys 


He 


Met 


Thr 


Asn 


Phe 


Glu 


Lys 


Xaa 


He 


Ala 


Gin 


1 








5 










10 










15 




Asn 


Arg 


Leu 


Lys 


Thr 


Asn 


Ala 


Val 


Leu 


Thr 


Thr 


Tyr 


Cys 


Ala 


He 


Phe 








20 










25 










30 






Ala 


Phe 


He 


Gly 


Leu 


Leu 


Val 


Asp 


Ala 


He 


Arg 


He 


Asn 


Ala 


Asn 


Asp 






35 










40 










45 






Leu 


Gly 


He 


Ala 


Leu 


Phe 


Lys 


Leu 


Met 


Thr 


Phe 


Gin 


He 


Phe 


Pro 


Thr 




50 










55 










60 










Xaa 


Thr- 


He 


Val 


Met 


Phe 


Val 


Val 


Ala 


Phe 


Val 


He 


Xaa 


Xaa 


Ser 


Leu 


65 










70 










75 










80 


Tyr 


Pro 


Lys 


Phe 
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(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 
{ 3 ) TYPE: amino acid 
<D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

{ A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . .76 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 



Met 


Xaa 


Met Ser 


His 


He 


He 


Lys 


Ser 


He 


Glu 


Ala 


Leu 


Asp 


Asp 


Tyr 


1 






5 










10 










15 




Thr 


lie 


Arg Phe 


Thr 


Leu 


Asn 


Gly 


Pro 


Glu 


Ala 


Pro 


Phe 


Leu 


Ala 


Asn 






20 










25 










30 






Leu 


Gly 


Met Asp 


Phe 


Leu 


Ser 


He 


Leu 


Ser 


Lys 


Asp 


Tyr 


Ala 


Asp 


Tyr 






35 








40 










45 








Leu 


Ala 


Gin Asn 


Asn 


Lys 


Lys 


Asp 


Glu 


Leu 


Ala 


Lys 


Xaa 


Pro 


Val 


Gly 




50 








55 










60 










Thr 


Gly 


Pro Phe 


Lys 


Phe 


Phe 


Leu 


Trp 


Asn 


Lys 


Arg 










65 








70 










75 












(2) 


INFORMATION 


FOR 


SEQ 


id : 


NO: 182 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 6 amino acids 
<B) TYPE: amino acid 
(D> TOPOLOGY: linear 

< i i ) MOLECULE TYPE : protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 . . . 196 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 



Leu 


Met 


Arg 


Lys 


He 


Phe 


Ser 


Tyr 


He 


Ser 


Lys 


Val 


Leu 


Leu 


Phe 


He 


1 








5 










10 










15 




Gly 


Val 


Val 


Tyr 


Ala 


Glu 


Pro 


Asp 


Ser 


Lys 


Val 


Glu 


Ala 


Leu 


Glu 


Gly 








20 










25 










30 






Arg 


Lys 


Gin 


Glu 


Ser 


Ser 


Leu 


Asp 


Lys 


Lys 


He 


Arg 


Gin 


Giu 


Leu 


Lys 






35 










40 










45 








Ser 


Lys 


Glu 


Leu 


Lys 


Asn 


Lys 


Glu 


Leu 


Lys 


Asn 


Lys 


Asp 


Leu 


Lys 


Asn 




50 










55 










60 










Lys 


Glu 


Glu 


Lys 


Lys 


Glu 


Thr 


Lys 


Ala 


Lys 


Arg 


Lys 


Pro 


Arg 


Ala 


Glu 


65 










70 










75 










80 


Val 


His 


His 


Gly 


Asp 


Ala 


Lys 


Asn 


Pro 


Thr 


Pro 


Lys 


He 


Thr 


Pro 


Pro 










85 










90 










95 




Lys 


He 


Lys 


Gly 


Ser 


Ser 


Lys 


Gly 


Val 


Gin 


Asn 


Gin 


Gly 


Val 


Gin 


Asn 








100 










105 










110 






Asn 


Ala 


Pro 


Lys 


Pro 


Glu 


Glu 


Lys 


Asp 


Thr 


Thr 


Pro 


Gin 


Ala 


Thr 


Glu 






115 










120 










125 








Lys 


Asn 


Lys 


Glu 


Thr 


Ser 


Pro 


Ser 


Ser 


Gin 


Phe 


Asn 


Ser 


He 


Phe 


Gly 



130 135 140 
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Asn 


Pro Asn Asn 


Ala 


Thr 


Asn Asn Thr 


Leu 


Glu 


Asp 


Lys 


Val 


Val 


Gly 


145 






150 






155 










160 


Gly 


lie Ser Leu 


Leu 


Val 


Asn Gly Ser 


Pro 


lie 


Thr 


Leu 


Tyr 


Gin 


He 






165 






170 










175 




Gin 


Glu Glu Gin 


Glu 


Lys 


Ser Lys Val 


Ser 


Xaa 


Ala 


Xaa 


Ala 


Arg 


Asp 




180 






185 










190 






Arg 


Leu Xaa Xaa 
























195 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 183 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 S ammo acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

{Vi} ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 ... 179 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 



Met 


Lys 


Arg 


Ser 


Ser 


Val 


Phe 


Ser 


Phe 


Leu 


Val 


Ala 


Phe 


Leu 


Leu 


Val 


1 








5 










10 










15 




Val 


Gly 


Cys 


Ser 


His 


Lys 


Met 


Asp 


Asn 


Lys 


Thr 


Val 


Ala 


Gly 


Asp 


Val 








20 










25 










30 






Ser 


Thr 


Lys 


Ala 


Val 


Gin 


Thr 


Ala 


Pro 


Val 


Thr 


Thr 


Glu 


Pro 


Ala 


Pro 






35 










40 










45 








Glu 


Lvs 


Glu 


Glu 


Pro 


Lys 


Gin 


Glu 


Pro 


Ala 


Pro 


Val 


Val 


Glu 


Glu 


Lys 




50 










55 










60 










Pro 


Ala 


lie 


Glu 


Ser 


Gly 


Thr 


He 


He 


Ala 


Ser 


He 


Tyr 


Phe 


Asp 


Phe 


65 










70 










75 










80 


Asp 


Lys 


Tyr 


Glu 


He 


Lys 


Glu 


Ser 


Asp 


Gin 


Glu 


Thr 


Leu 


Asp 


Glu 


lie 










85 










90 










95 




Val 


Gin 


Lys 


Ala 


i>ys 


Glu 


Asn 


His 


Met 


Gin 


Val 


Leu 


Leu 


Glu 


Gly 


Asn 








100 










105 










110 






Thr 


Asp 


Glu 


Phe 


Gly 


Ser 


Ser 


Glu 


Tyr 


Asn 


Gin 


Ala 


Leu 


Gly 


Val 


Lys 






115 










120 










125 








Arg 


Thr 


Leu 


Ser 


Val 


Lys 


A.sn 


Ala 


Leu 


Val 


He 


Lys 


Gly 


Val 


Glu 


Lys 




130 










135 










140 








Asp 


Met 


He 


Lys 


Thr 


He 


Ser 


Phe 


Gly 


Glu 


Ser 


Lys 


Pro 


Lys 


Cys 


Val 


145 










150 










155 










160 


Gin 


Lys 


Thr 


Arg 


Glu 


Cys 


Tyr 


Arg 


Glu 


Asn 


Arg 


Arg 


Val 


Asp 


Val 


Lys 



165 170 175 



Leu Val Lys 



(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 86 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 
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(A) NAME /KEY : mi s c_£ ea cur e 

(B) LOCATION 1 . . .2 86 





f x i ) 


SEQUENC 


:e descri 


:?TION: SEQ i 


:d nc 


>: 184 










Met 




Thr 


Leu 


He 


Glu 


Lys 


Trp 


Phe 


Glv 


Phe 


Ser 


Gin 


He 


Arg 


Glu 










5 










10 










15 




Glu 


Leu 


Glu 


Ala 


Arg 


He 


Ser 


Glu 


Leu 


Glu 


Asp 


Glu 


Asn 


Thr 


Glu 


Leu 








20 








25 










30 






Leu 


Arg 


Glu 


Arg 


Glu 


Tyr 


Leu 


Ala 


Ala 


Glu 


Thr 


Ser 


Glu 


Leu 


Lys 


Asp 






35 










40 










45 








r\ x a 


Asn 


Asp 


Gin 


Leu 


Arg 


Gin 


Lys 


Asn 


Asp 


Lys 


Leu 


Phe 


He 


Thr 


Lys 




SO 










55 










60 










Asp 


Lys 


Leu 


Thr 


Lys 


Glu 


Asn 


Thr 


Glu 


Leu 


Phe 


Ala 


Glu 


Asn 


Glu 


Ser 


£ t: 

O J 








70 










75 










80 


Leu 




Val 


Lys 


He 


Ser 


Gly 


Leu 


Glu 


K i s 


Ser 


Asn 


Asp 


Gin 


Leu 


* - t-* 










85 










9 0 










95 








n3n 


n5H 


uys 




Thr 


Lys 


Glu 


uy ^ 


Ai. at 


ulu 


i-ieiU 


L *£3 


Thx 


Gi u 








100 










105 










110 








Asp 


lie 


Leu 


Ala 


Lys 


Glu 


Asn 


Thr 


M.X 9 


Leu 


Leu 


Ala 


Ala 




Asd 






115 










120 










12 5 








Arg 


Leu 


Thr 


Glu 


Glu 


Lys 


Arg 


Glu 


Leu 


X ilX 


Thr 


Glu 


Lys 


V3 X Ul 


Ar g 






Tin 










135 










140 










Lys 


Arg 


Glu 


Asn 


Thr 


Glu 


Leu 


Thr 


His 


Lys 


He 


Thr 


L; _L U 


Leu 


i nr 


LyS 


14 5 










150 










155 










x O U 


Glu 


Asn 


Lys 


Ala 


Leu 


Thr 


Thr 


Glu 


Asn 


Asp 


Lys 


Leu 


Asn 




pi n 

V3 X I 1 


\7 T) "1 
vUl 








165 










X / U 










17c. 




i nr 




Leu 


Thr 


Asn 


Glu 


Arg 


Asp 


Ser 


Leu 


Glu 


Gin 




Arg 


Ala 


Arg 








180 










185 










1 OA 






Leu 


Gin 


Asp 


Ala 


His 


Gly 


Phe 


Leu 


Glu 


Lys 


Arg 


Cys 


Thr 


Asn 


Leu 


Glu 






195 










200 










205 








Lys 


Glu 


Asn 


Gin 


Arg 


Leu 


Thr 


Asp 


Lys 


Leu 


Lys 


Gin 


Leu 


Glu 


Ser 


Ala 




210 










215 










220 










Gin 


Lys 


Ser 


Leu 


Glu 


Asn 


Thr 


Asn 


Asn 


Gin 


Leu 


Arg 


Gin 


Ala 


Leu 


Glu 


225 








230 










235 










240 


Asn 


Ser 


Asn 


Val 


Gin 


Leu 


Ala 


Gin 


Ala 


Lys 


Glu 


Xaa 


He 


Ala 


He 


Glu 










245 










250 










255 




Xaa 


Ser 


Glu 


Leu 


Xaa 


Arg 


Arg 


Asn 


Arg 


Thr 


Leu 


Glu 


Glu 


Leu 


Arg 


Gly 








260 










265 










270 






Tyr 


Gly 


Ser 


Gin 


Lys 


Xaa 


He 


Trp 


Thr 


Tyr 


Thr 


Xaa 


Gly 


Val 










275 










280 










285 









(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Helicobacter pylori 

<ix) FEATURE : 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . . 110 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 

Val Leu Arg Lys Leu Leu Gly Lys Asn Cys He Glu Thr His Lys Gly 

15 10 15 

Val Gly Tyr Arg Leu Thr His Tyr Glu Lys Lys Ser Leu Lys Leu Phe 

20 25 30 

Leu Gly Thr Tvr Leu Gly Ser Ser Phe Val Leu Met Leu Val He Ser 

35 * 40 45 

Val Leu Ala Phe Asn Tyr Glu Lys Asn Glu Lys He Lys Xaa He Arg 
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50 55 60 

Met: Asp Met Asp Lys Met Ala Ser Lys lie Ala Ser Glu lie He Gin 
65 70 75 80 

Leu His Met Gin Thr His Ala Asp Tyr His Asn Ala Leu Asn Ala Leu 

85 90 95 

He Ser Arg Tyr Lys Asp Val Ser He Xaa Leu Xaa Asp Thr 
100 105 HO 

(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 12 4 amino acids 
(3) TYPE: amino acid 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION I . . . 124 





(xi) 


SEQUENCE DESCRIPTION: i 


SEQ : 


ID NO:186 










Leu 


Met 


Thr 


Lys 


Ser 


Leu 


Lys 


Leu 


He 


Gin 


Lys 


Gly 


Val 


Lys 


Asn 


Leu 


x 








5 










10 










15 




Tyr 


Glu 


Thr 


Leu 


Lys 


Asn 


Arg 


Ala 


Leu 


Glu 


His 


Gin 


Asp 


Thr 


Leu 


Met 








20 










25 










30 






Val 


Gly 


Ara 


Ser 


His 


Gly 


Val 


Phe 


Gly 


Glu 


Pro 


He 


Thr 


Phe 


Gly 


Leu 




35" 










40 










45 








Val 


Leu 


Ala 


Leu 


Phe 


Ala 


Asp 


Glu 


He 


Lys 


Arg 


His 


Leu 


Lys 


Ala 


Leu 




50 










55 










60 










Asp 


Leu 


Thr 


Met 


Glu 


Phe 


He 


Xaa 


Val 


Gly 


Ala 


He 


Ser 


Gly 


Ala 


Met 


65 










70 










75 










80 


Gly 


Asn 


Phe 


Ala 


His 


Ala 


Pro 


Leu 


Glu 


Leu 


Glu 


Glu 


Leu 


Ala 


Cys 


Gly 








85 










90 










95 




?he 


Leu 


Gly 


Leu 


Lys 


Thr 


Ala 


Asn 


He 


Ser 


Asn 


Gin 


Val 


He 


Gin 


Arg 








100 










105 










110 






Asp 


Arg 


Tyr 


Ala 


Gly 


Leu 


His 


Ala 


He 


Trp 


Leu 


Phe 














115 










120 


















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 187 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 ... 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 

Leu Ser Asp Ala Ser Lys Arg Ser Leu Asn Pro Thr Leu Met Met Asn 
15 10 15 

Asn Asn Asn Thr Leu Pro Lys Pro Leu Glu Glu Ser Leu Asp Leu Lys 
20 25 30 
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Glu 


?he He Ala 


Leu 


Phe 


Lys 


Thr Phe 


Phe 


Ala 


Lys 


Glu 


Arg 


Gly 


Ser 




35 








40 








45 








He 


Ala Leu Glu 


Asn 


Asp 


Leu 


Lys Gin 


Ala 


Phe 


Thr 


Tyr 


Leu 


Asn 


Glu 




50 






55 








60 










Val 


Asp Ala He 


Gly 


Leu 


Pro 


Ala Pro 


Xaa 


Lys 


Arg 


Glu 


Arg 


Lys 


Arg 


65 






70 








75 










80 


Ser 


Tyr Cys Cys 


Gin 


Thr 


His 


Gin He 


Arg 


Asp 


Ala 


Pro 


Phe 


Arg 








85 








90 










95 




(2) 


INFORMATION 


FOR 


SEQ 


ID NO : 188 : 

















{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

di) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 80 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188 



Leu 


Pro 


He 


He 


Leu 


Xaa 


Val 


lie Val 


Met 


Met 


Phe 


Phe 


Ser 


Lvs 


He 


1 








5 








10 










15 




Val 


Gly 


Asp 


Phe 


He 


Glu 


Lys 


His Tyr 


Arg 


Val 


Lys 


Thr 


Leu 


Ala 


Phe 








20 








25 










30 






Val 


Phe 


Leu 


Leu 


Val 


Val 


Gly 


Val Phe 


Leu 


Phe 


Leu 


Glu 


Gly 


Leu 


His 






35 










40 








45 








Leu 


His 


lie 


Asn 


Lys 


Asn 


Tyr 


Leu Tyr 


Ala 


Gly 


He 


Gly 


Phe 


Ala 


Leu 




50 










55 








60 










Leu 


He 


Glu 


Cys 


Leu 


Xaa 


He 


Phe He 


Glu 


Lys 


Lys 


Met 


Lys 


Lys 


Ser 


65 










70 








75 










80 


(2) 


INFORMATION 


FOR 


SEQ 


ID ) 


NO : 189 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .265 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 



Met 


He 


Lys 


Ala 


Arg 


Phe 


Lys 


Lys 


Arg 


Leu 


Leu 


Gly 


Ser 


Arg 


Gly 


Ala 


x 








5 










10 










15 




Phe 


Asp 


Leu 


Asn 


lie 


Asp 


Leu 


Glu 


He 


Lys 


Glu 


Ala 


Glu 


Val 


Val 


Ala 








20 










25 










30 






Leu 


Leu 


Gly 


Glu 


Ser 


Gly 


Ala 


Gly 


Lys 


Ser 


Thr 


He 


Leu 


Arg 


He 


Leu 






35 










40 










45 








Ala 


Gly 


Leu 


Glu 


Ala 


Val 


Ser 


Ser 


Gly 


Tyr 


He 


Glu 


Ala 


Asn 


His 


Ser 




50 










55 










60 










Val 


Trp 


Leu 


Asp 


Thr 


Gin 


Lys 


Lys 


He 


Phe 


Leu 


Lys 


Pro 


Gin 


Gin 


Arg 
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65 










7 0 










75 










80 


Lys 


He 


Gly 


Phe 


Val 


Phe 


Gin 


Asp 


Tyr 


Ala 


Leu 


Phe 


Pro 


His 


Leu 


Asn 










8 5 










90 










9 5 




Val 


Tyr 


Gin 


Asn 


He 


Ala 


Phe 


Ala 


His 


Pro 


Lys 


Asp 


Lys 


Asn 


Lys 


He 








100 










105 










110 






His 


Glu 


Val 


Leu 


Arg 


Leu 


Met 


Arg 


Leu 


Glu 


Asn 


Leu 


Ser 


Gin 


Gin 


Lys 






115 










120 










125 








He 


Pro 


Lys 


Leu 


Ser 


Gly 


Gly 


Gin 


Ala 


Gin 


Arg 


Val 


Ala 


Leu 


Ala 


Arg 




13 0 










135 










140 










Ala 


Leu 


He 


Ala 


Ala 


Lys 


A.sn 


Leu 


Leu 


Leu 


Leu 


Asp 


Glu 


Pro 


Leu 


Asn 


145 










150 










155 










160 


Ala 


Leu 


Asp 


Asn 


Ala 


Leu 


Lys 


Asn 


Glu 


Val 


Gin 


Gin 


Gly 


Leu 


Leu 


Asp 










165 










170 










175 




Phe 


He 


Lys 


Arg 


Glu 


Asn 


Leu 


Ser 


Val 


Leu 


Leu 


Val 


Ser 


His 


Asp 


Pro 








180 










185 










190 






Asn 


bxU 


xie 


I nr 


Lys 


Leu 


Ala 


Arg 


Thr 


Phe 


Leu 


Phe 


Leu 


Asn 


Asn 


Gly 






195 










200 










205 








Val 






Pre 


hzzz 


Gin 


Clu 




TV 


Leu 


Phe 






Arg 




Leu 




210 










215 










220 








Val 


Lys 


Pro 


Leu 


Phe 


Glu 


Asp 


Glu 


Asn 


Tyr 


Cys 


His 


Tyr 


Glu 


Val 


lie 


225 










230 










235 










240 


Pro 


Gin 


Thr 


He 


Ser 


Leu 


Pro 


Lys 


Asp 


Cys 


Leu 


Asn 


Pro 


Thr 


Phe 


Lys 










245 










250 










255 




Leu 


Asp 


Phe 


He 


Gin 


Asn 


Lys 


Lys 


Phe 

















260 265 
(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 ... 64 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190 



Val 


Lys 


Phe 


Ser 


Val 


Leu 


Thr 


Leu 


Phe 


Pro 


Gin 


Leu 


lie 


Leu 


Pro 


Tyr 


1 








5 










10 










15 


Phe 


Glu 


Asp 


Ser 


He 


Leu 


Lys 


Arg 


Ala 


Leu 


Glu 


Lys 


Asn 


Leu 


Phe 


Glu 








20 










25 










30 






Leu 


Glu 


Val 


Leu 


Asn 


Leu 


Arg 


Asp 


Phe 


Ser 


Ala 


Asn 


Lys 


Tyr 


Gin 


Lys 






35 










40 










45 








Ala 


Xaa 


Ser 


His 


Ala 


His 


Trp 


Trp 


Gly 


Cys 


Gly 


Ala 


Asn 


Phe 


Arg 


Pro 



50 55 60 



(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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{ix} FEATURE: 

(A) NAME / KEY : misc_ feature 

(B) LOCATION 1 ... 138 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 

o Lys Thr Pro Leu Val lie Lys Pro Tyr Leu Lys 

10 15 

r Glu lie Phe Ala Xaa Xaa Cys Val Giy Met Ala 

25 30 

0 Val Leu Ala Gly Tyr Ala Ser Met Giy lie Pro 
40 45 

e Ala Ala Ser Phe Met Ser Ala Pro Giy Giy Leu 

55 60 
r lie Tyr Pro Gin Asn Glu Thr lie Ser Ser His 

70 75 80 

a Glu Glu Kis Val Asn lie lie Glu Ala Xaa Ala 

SO 

r Gly Xaa His Leu Ala Leu His Val Gly Ala Met 
105 110 

1 Gly Met Val Ala Leu Val Asn Gly Leu Leu Gly 
120 125 

ie Leu Gly Met Glu His 
135 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
£B) LOCATION 1 . . . 116 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 



Leu 


T 1 >-~ r~i 


Ar q 


Th r 

X 1 1 L 


1 












c o v 


ASp 








20 


Ser 


Val 


Xaa 


Gly 






35 




Leu 


Pro 


Tyr 


Leu 




50 






Leu 


Phe 


Ala 


Lys 


65 








Ala 


Asp 


Val 


Ser 


Xaa 


Gly 


Ala 


Ser 








100 


Leu 


Leu 


Ala 


Phe 






115 




Val 


Val 


Gly 


Gly 




130 







Val 


Met 


Asn 


Phe 


Phe 


Val 


Gly 


Gly 


Leu 


Ser 


He 


Val 


Cys 


Asn 


Val 


Val 


1 








5 










10 










15 




Val 


lie 


Thr 


Tyr 
20 


Ser 


Ala 


Leu 


His 


Pro 
25 


Thr 


Ala 


Pro 


Val 


Glu 
30 


Gly 


Ala 


Glu 


Asp 


He 


Val 


Gin 


Val 


Ser 


His 


His 


Leu 


Thr 


Ser 


Phe 


Tyr 


Gly 


Pro 




35 










40 










45 








Ala 


Thr 


Gly 


Leu 


Leu 


Phe 


Gly 


Phe 


Thr 


Tyr 


Leu 


Tyr 


Ala 


Ala 


He 


Asn 




50 








55 










60 








Phe 


His 


Thr 


Phe 


Gly 


Leu 


Asp 


Trp 


Arg 


Pro 


Tyr 


Ser 


Trp 


Tyr 


Ser 


Leu 


65 








70 










75 










80 


Val 


Ala 


He 


Asn 


Thr 
85 


Val 


Pro 


Ala 


Ala 


He 
90 


Leu 


Ser 


His 


Tyr 


Ser 
95 


Asp 


Met 


Leu 


Asp 


Asp 


His 


Lys 


Val 


Leu 


Gly 


He 


Thr 


Glu 


Gly 


Asp 


Trp 


Trp 






100 










105 










110 






Ala 


He 


He 
115 


Xaa 



























(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 27 amino acids 

(B) TYPE: amino acid 
{ D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : protein 

(iii) HYPOTHETICAL: YES 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc_feature 
(3) LOCATION i . . . 227 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 



Val 


Leu 


Leu 


Gly 


Lys 


His 


Ser 


Gly 


Ala 


Gly 


Leu 


Leu 


Ser 


Ala 


Leu 


Xaa 


1 








5 










10 










15 




Ala 


Leu 


Ser 


Phe 


Gly 


Ser 


Gly 


Val 


Val 


Ser 


He 


Gin 


Ala 


Leu 


Glu 


Cys 








20 










25 










30 






Glu 


lie 


Thr 


ber 


Asn 


Asn 


Lys 


Pro 


Leu 


Gi u 


Leu 


Vai 


?he 


Cy a; 


Glu 








35 










40 










45 








Phe 


Pro 


Lys 


Lys 


Leu 


Ser 


Ala 


Phe 


Ala 


Leu 


Gly 


Met 


Gly 


Leu 


Glu 


Asn 




50 










55 










60 










He 


Pro 


Lys 


Asp 


Phe 


Lvs 


Lys 


Trp 


Leu 


Glu 


Leu 


Ala 


Pro 


Cys 


Val 


Leu 


65 










70 










75 










80 


Asp 


Ala 


Gly 


Val 


Phe 


Tyr 


His 


Lys 


Glu 


Val 


Leu 


Gin 


Ala 


Leu 


Glu 


Lys 










85 










90 










95 




Glu 


Val 


He 


Leu 


Thr 


Pro 


His 


Pro 


Lys 


Glu 


Phe 


Leu 


Ser 


Leu 


Leu 


Lys 








100 










105 










110 






Ser 


Val 


Gly 


He 


Asn 


He 


Ser 


Met: 


Leu 


Glu 


Leu 


Leu 


Asp 


Asn 


Lys 


Leu 






115 










120 










125 








Glu 


He 


Ala 


Arg 


Asp 


Phe 


Ser 


Gin 


Lys 


Tyr 


Pro 


Lys 


Val 


Val 


Leu 


Leu 




130 










135 










140 










Leu 


Lys 


Gly 


Ala 


Asn 


Thr 


Leu 


He 


Ala 


His 


Gin 


Gly 


Arg 


Val 


Phe 


He 


145 










150 










155 










160 


Asn 


Asn 


Leu 


Gly 


Ser 


Val 


Ala 


Leu 


Ala 


Lys 


Ala 


Gly 


Ser 


Gly 


Asp 


Val 










165 










170 










175 




Leu 


Ala 


Gly 


Leu 


He 


Val 


Ser 


Leu 


Leu 


Ser 


Gin 


Asn 


Tyr 


Thr 


Pro 


Leu 








180 










185 










190 






Xaa 


Ala 


Ala 


He 


Asn 


Ala 


Ser 


Leu 


Ala 


His 


Ala 


Leu 


Ala 


Gly 


Leu 


Xaa 






195 










200 










205 








Phe 


Lvs 


Asn 


Xaa 


Xaa 


Ala 


Leu 


Thr 


Pro 


Xaa 


Asp 


Leu 


He 


Glu 


Lys 


Xaa 




210 










215 










220 










Lys 


Arg 


Leu 





























225 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 . . . 109 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:194 

Val Xaa Leu Tyr Leu Ala Leu Thr Leu Ser Leu Gly He Ala Met Leu 

15 10 15 

Leu Val Glu Met Leu He Gly Asn Leu Gly Lys Lys Asp Val Val Ser 

20 25 30 

Asn Tyr Gin He Leu Asp Pro Lys Arg Lys Lys Tyr Tyr Pro Phe Thr 
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35 40 45 

Ser Phe Phe He Leu Gly Gly Pro Leu He Leu Ser Phe Tyr Ala Val 

50 55 60 

Val Leu Gly Tid Val Leu Tyr Tyr Leu Phe Val Val Thr Phe Asp Leu 
65 " 70 75 80 

D ro Lvs Aso Leu Xaa Gin Ala Lys Met Gin Phe Xaa Met Leu Gin Asn 

85 90 95 

Gly Ser Leu He Trp Pro Val He Asp Phe Ser Ala Cys 
100 105 

(2) INFORMATION FOR 3EQ ID NO: 19 5: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii MOLECULE TYPE: piuLciu 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc.feature 

(B) LOCATION 1 . . .97 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195 



Leu 


Thr 


Thr 


Lys 


Ala 


Cys 


Trp 


Leu 


Leu 


Arg 


Val 


Cys 


Cys 


Tyr 


Arg 


Ser 


1 






5 










10 










15 




Leu 


Asn 


He 


Thr 


He 


Lys 


Asp 


Arg 


Thr 


Met 


Lys 


Thr 


Asn 


Gly 


His 


Phe 








20 










25 










30 


Val 


Ala 


Lys 


Asp 


Phe 


Ala 


Trp 


Lys 


Lys 


Cys 


Phe 


Leu 


Gly 


Ala 


Ser 


Val 


35 










40 










45 




Val 


Ala 


Leu 


Leu 


Val 


Gly 


Cys 


Ser 


Pro 


His 


He 


He 


Glu 


Thr 


Asn 


Glu 




50 








55 










60 










Leu 


Lys 


Leu 


Asn 


Tyr 


His 


Pro 


Ala 


Ser 


Glu 


Lys 


Val 


Gin 


Ala 


Leu 


Asp 


65 








70 










75 










80 


Glu 


Lys 


He 


Leu 


Leu 


Leu 


Arg 


Pro 


Ala 


Phe 


Gin 


Tyr 


Ser 


Xaa 


Asn 


He 








85 










90 










95 





Cys 

(2)" INFORMATION FOR SEQ ID NO: 19 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . . . 145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 

Leu Ser Glu Trp Gin Thr Phe Cys Leu Lys Asp Leu Gly Lys He Val 
15 10 15 

Gly Glv Ala Thr Pro Pro Thr Asn Asn Pro Lys Asn Tyr Gly Asn Lys 
20 25 30 
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He 


Ala 


Trp 


He 


Thr 


Pro 


Lys 


Asp 


Leu 


Ser 


Thr 


Leu 


Gin 


Gly 


Arg 


Tyr 






35 










40 










45 








He 


Lys 


Lys 


Gly 


Ser 


Arg 


Ser 


He 


Ser 


Arg 


Leu 


Gly 


Phe 


Lys 


Ser 


Cys 




50 










55 










60 










Ser 


Cys 


Val 


Leu 


Leu 


Pro 


Lys 


His 


Ala 


XT,-, 

He 


Leu 


Phe 






Arg 


h j. a 


65 








70 










75 










80 


Pro 


He 


Gly 


Tyr 


Val 


Ala 


He 


Ala 


Glu 


Lys 


Arg 


Leu 


Cys 


Thr 


Asn 


Gin 










85 










90 










95 




Gly 


Phe 


Lys 


Ser 


He 


He 


Pro 


Asn 


Lys 


Lys 


He 


Tyr 


Phe 


Glu 


Phe 


Leu 






100 










105 










110 






Tyr 


Tyr 


Leu 


Leu 


Lys 


■Tyr 


Tyr 


Lys 


Asp 


Asn 


He 


Ser 


Asn 


He 


Gly 


Gly 


115 










120 










125 








Gly 


Thr 


Thr 


Phe 


Lys 


Glu 


Val 


Ser 


Gly 


Ala 


Thr 


Leu 


Gly 


Ser 


He 


Pro 



130 135 140 



Ser 
145 

(2) INF GRMATI ON FOR SEQ ID NO : 1 3 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
f D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 
(3) LOCATION 1 . . . 273 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 



Met 


Glu 


Phe 


Met 


Lys 


Lys 


Phe 


Val 


Ala 


Leu 


Gly 


Leu 


Leu 


Ser 


Ala 


Val 










5 










10 










15 




Leu 


Ser 


Ser 


Ser 


Leu 


Leu 


Ala 


Glu 


Gly 


Asp 


Gly 


Val 


Tyr 


He 


Gly 


Thr 








20 










25 










30 






Asn 


Tyr 


Gin 


Leu 


Gly 


Gin 


Ala 


Arg 


Leu 


Asn 


Ser 


Asn 


He 


Tyr 


Asn 


Thr 






35 










40 










45 








Gly 


Aso 


Cys 


Thr 


Gly 


Ser 


Val 


Val 


Gly 


Cys 


Pro 


Pro 


Gly 


Leu 


Thr 


Ala 




50 










55 










60 










Asn 


Lys 


His 


Asn 


Pro 


Gly 


Gly 


Thr 


Asn 


He 


Asn 


Trp 


His 


Ser 


Lys 


Tyr 


65 










70 










75 










80 


Ala 


Asn 


Gly 


Ala 


Leu 


Asn 


Gly 


Phe 


Gly 


Leu 


Asn 


Val 


Gly 


Tyr 


Lys 


Lys 










85 










90 










95 




Phe 


Phe 


Gin 


Phe 


Lys 


Ser 


Leu 


Asp 


Met 


Thr 


Ser 


Lys 


Trp 


Phe 


Gly 


Phe 








100 










105 










110 






Arg 


Val 


Tyr 


Gly 


Leu 


Phe 


Asp 


Tyr 


Gly 


His 


Ala 


Asp 


Leu 


Gly 


Lys 


Gin 






115 










120 










125 








Val 


Tyr 


Ala 


Pro 


Asn 


Lys 


He 


Gin 


Leu 


Asp 


Met 


Val 


Ser 


Trp 


Gly 


Val 




130 










135 










140 










Gly 


Ser 


Asp 


Leu 


Leu 


Ala 


Asp 


He 


He 


Asp 


Lys 


Asp 


Asn 


Ala 


Ser 


Phe 


145 










150 










155 










160 


Gly 


He 


Phe 


Gly 


Gly 


Val 


Ala 


He 


Gly 


Gly 


Asn 


Thr 


Trp 


Lys 


Ser 


Ser 










165 










170 










175 




Ala 


Ala 


Asn 


Tyr 


Trp 


Lys 


Glu 


Gin 


lie 


lie 


Glu 


Ala 


Lys 


Gly 


Pro 


Asp 








180 










185 










190 






Val 


Cys 


Thr 


Pro 


Thr 


Tyr 


Cys 


Asn 


Pro 


Asn 


Ala 


Pro 


Tyr 


Ser 


Thr 


Asn 






195 










200 










205 








Thr 


Ser 


Thr 


Val 


Ala 


Phe 


Gin 


Val 


Trp 


Leu 


Asn 


Phe 


Gly 


Val 


Arg 


Ala 




210 










215 










220 










Asn 


He 


Tyr 


Lys 


His 


Asn 


Gly 


Val 


Glu 


Phe 


Gly 


Val 


Arg 


Val 


Pro 


Leu 


225 








230 










235 










240 


Leu 


lie 


Asn 


Lys 


Phe 


Leu 


Ser 


Ala 


Gly 


Pro 


Asn 


Ala 


Thr 


Asn 


Leu 


Tyr 



245 250 255 
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Tyr His Leu Lys Arg Asp Tyr Ser Leu Tyr Leu Gly Tyr Asn Tyr Thr 
260 265 270 

Phe 

(2) INFORMATION FOR SEQ ID NO : 198 : 

(iJ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

( B ) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) UKUANISM: Hex i cobdc Lei uylun 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 . - - 148 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 8 



65 



Val 


Gin 


He 


Val 


Val 


Val 


Phe 


Tyr 


Gly 


Leu 


Pro 


Ala 


Leu 


Gly 


Vai 






5 










10 










15 




Met 


Asp 


Pro 


lie 


Pro 


Ala 


Gly 


He 


He 


Ala 


Phe 


Ser 


Phe 


Asn 


Val 




20 










25 










30 


Val 


Pro 


Ala 


Tyr 


Ala 


Ser 


Glu 


Thr 


Leu 


Arg 


Ala 


Ser 


Phe 


Leu 


Ser 




35 










40 










45 






Gin 


Asp 


Gin 


Trp 


Asp 


Ser 


Ser 


Leu 


Ser 


Leu 


Gly 


Leu 


Asn 


Tyr 


Leu 


50 








55 










60 






Thr 




Phe 


Trp 


His 


Val 


He 


Phe 


Phe 


Gin 


Ala 


Leu 


Lys 


Val 


Ala 


Pro 






70 










75 










80 


Leu 


Ser 


Asn 


Thr 


Phe 


He 


Ser 


Leu 


Phe 


Lys 


Glu 


Thr 


Ser 


Leu 


Ala 








85 










90 








Gin 


95 


Xaa 


Val 


Val 


Thr 


He 


Ala 


Glu 


Xaa 


Phe 


Arg 


He 


Ala 


Gin 


Lys 




100 










105 










110 




He 


Val 


Ser 


Tyr 


Asp 


Phe 


Xaa 


Pro 


He 


Tyr 


Leu 


Glu 


Xaa 


Ala 


Leu 




115 






120 










125 




Val 




Tro 


Leu 


Phe 


Cys 


Leu 


Val 


Leu 


Glu 


Val 


He 


Gin 


Lys 


Arc 


Glu 


130 










135 










140 










He 


Leu 


Asn 


























INF 


ORMATION 


FOR 


SEQ 


ID 


NO : 199 : 

















145 



{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION I. -.134 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 

Va 1 Val Ala Asp Glu Val Arg Asn Leu Ala Gly Arg Thr Gin Lys Ser 

1 5 10 15 

Leu Ala Glu He Asn Ser Thr He Met Val He Vai Gin Glu He Asn 
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20 25 30 



Asp 


Val 


Ser 


Ser 


Gin 


Met 


Asn 


Leu 


Asn 


Ser 


Gin 


Lys 


Met 


Glu 


Arg 


Leu 




35 










40 










45 








Ser 


Asp 
50 


Met 


Ser 


Lys 


Ser 


Val 
55 


Gin 


Glu 


Thr 


Tyr 


Glu 
60 


Lys 


Met 


Ser 


Ser 


Asn 


Leu 


Ser 


Ser 


Val 


Val 


Leu 


Asp 


Ser 


Asn 


Gin 


Ser 


Met 


Asp 


Asp 


Tyr 


65 










70 








75 










80 


Ala 


Lys 


Ser 


Gly 


His 


Gin 


He 


Glu 


Ala 


Met 


Val 


Ser 


Asp 


Phe 


Ala 


Glu 








85 










90 










95 




Val 


Glu 


Lys 


Val 


Ala 


Ser 


Lys 


Thr 


Leu 


Ala 


Asp 


Ser 


Ser 


Asp 


He 


Leu 






100 










105 










110 






Asn 


He 


Ala 
115 


Thr 


His 


Val 


Ser 


Gly 
120 


Thr 


Thr 


Met 


Asn 


Leu 
125 


Xaa 


Lys 


Gin 


Val 


Asn 
130 


Leu 


Phe 


Lys 


Thr 























(2) INFORMATION FOR SEQ ID NO; 20 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
{ B) LOCATION 1 . . . 133 



(xx) SEQUENCE DESCRIPTION: SEQ ID NO: 20 0 



Met 


Asn 


Tyr 


Asp 


Asn 


Tyr 


Trp 


Asp 


Glu 


Asp 


Lys 


Pro 


Glu 


Leu 


Asn 


He 


1 








5 










10 










15 




Thr 


Pro 


Leu 


Val 


Asp 


Val 


Met 


Leu 


Val 


Leu 


Leu 


Ala 


He 


Leu 


Met 


Val 








20 








25 










30 






Thr 


Thr 


Pro 


Thr 


Leu 


Thr 


Tyr 


Lys 


Glu 


Glu 


lie 


Ala 


Leu 


Pro 


Ser 


Gly 






35 










40 










45 








Ser 


Lvs 


Thr 


Ala 


Arg 


Ala 


Thr 


Gin 


Asp 


Lys 


Val 


He 


Glu 


He 


Arg 


Met 




SO 










55 










60 










Asp 


Lys 


Asp 


Ala 


Lys 


He 


Tyr 


He 


Asp 


Ser 


Gin 


Thr 


Tyr 


Glu 


Tyr 


Xaa 


65 










70 










75 










80 


Ser 


Phe 


Pro 


Asp 


Thr 


Phe 


Asn 


Leu 


Leu 


Ser 


Lys 


Lys 


Tyr 


Asp 


Lys 


Asp 










35 










90 










95 




Thr 


Arg 


Val 


Ser 


He 


Arg 


Ala 


Asp 


Lys 


Arg 


Leu 


Thr 


Tyr 


Asd 


Lys 


Val 








100 










105 










110 






He 


Tyr 


Leu 


Leu 


Lys 


Thr 


He 


Lys 


Glu 


Ala 


Gly 


Phe 


Leu 


Lys 


Val 


Ser 






115 










120 










125 








Leu 


He 


Thr 


Ser 


Pro 

























130 



(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

<r±i) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9719098A1_l_>, 



WO 97/19098 



PCT/US96/ 18542 



182 

( ix ) FEATURE : 

(A) NAME/KEY: mi sc_f eacure 

(B) LOCATION 1 ... 71 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 01 



Met 


Pro 


Pro 


Thr 


Xaa 


Pro 


Gin 


Ala 


Ser 


He 


Leu 


Arg 


Leu 


Thr 


Leu 


Lys 


1 








5 










10 










15 




Asn 


Pro 


Leu 


Xaa 


Xaa 


Leu 


Ser 


Arg 


Tyr 


Ser 


Leu 


Cys 


Leu 


Leu 


Lys 


Lys 








20 










25 










30 






Thr 


Arg 


Leu 


Gin 


Thr 


Thr 


Ser 


Asn 


Ser 


Ala 


Pro 


Lys 


Ala 


Cys 


Leu 


He 




35 










40 










45 








Ala 


Gly 


Leu 


Leu 


Lys 


Lys 


Ser 


Lys 


Pro 


Phe 


He 


Leu 


Asn 


Thr 


Leu 


Lys 




50 










55 










60 










He 


Arg 


Ser 


Leu 


Leu 


Lys 


Pro 





















65 70 
(2) INFORMATION FOR SEQ _lD NO . z. u ^ . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 217 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202 



Met 


Pro 


Val 


He 


Arg 


Val 


Leu 


Val 


Met 


Leu 


Ala 


Thr 


Met 


Met 


Met 


Lys 


1 








5 










10 










15 




Leu 


Val 


Lys 


Thr 


Ala 


Lys 


Glu 


Lys 


Lys 


Val 


Phe 


Lys 


Asn 


Val 


Gly 


He 






20 








25 










30 






Ser 


lie 


Met 


Gly 


He 


Ala 


Phe 


Trp 


Glu 


Ala 


He 


Lys 


Asp 


Ser 


He 


Lys 






35 








40 










45 








Lys 


Gin 


He 


Lys 


Lys 


Ser 


Asp 


Trp 


He 


Cys 


Gly 


Asn 


Val 


Lys 


Thr 


Ala 


50 










55 










60 










Asp 


Asp 


Tyr 


Leu 


Lys 


Thr 


His 


Pro 


Asn 


Ser 


Trp 


Phe 


Asn 


Ser 


Ala 


He 


65 








70 










75 










80 


Gly 


Val 


Thr 


Ala 


lie 


Thr 


Ala 


Met 


Leu 


Met 


Asn 


Val 


Cys 


Phe 


Ala 


Asp 








85 










90 










95 




Asp 


Gin 


Ser 


Lys 


Lys 


Glu 


Val 


Ala 


Gin 


Ala 


Gin 


Lys 


Glu 


Ala 


Glu 


Asn 






100 








105 










110 






Ala 


Arg 


Asp 


Arg 


Ala 


Asn 


Lys 


Ser 


Gly 


He 


Glu 


Leu 


Glu 


Gin 


Glu 


Glu 






115 










120 










125 








Gin 


Lys 


Thr 


Glu 


Gin 


Glu 


Lys 


Gin 


Lys 


Thr 


Glu 


Gin 


Glu 


Lys 


Gin 


Lys 




130 










135 










140 










Thr 


Glu 


Gin 


Glu 


Lys 


Gin 


Lys 


Thr 


Glu 


Gin 


Glu 


Lys 


Gin 


Lys 


Thr 


Glu 


145 








150 








155 










160 


Gin 


Glu 


Lys 


Gin 


Lvs 


Thr 


Ser 


Asn 


He 


Glu 


Thr 


Asn 


Asn 


Gin 


He 


Lys 








165 










170 










175 




Val 


Glu 


Gin 


Glu 


Gin 


Gin 


Lys 


Thr 


Glu 


Gin 


Glu 


Lys 


Gin 


Lys 


Thr 


Asn 








180 










185 










190 






Asn 


Thr 


Gin 


Lys 


Asp 


Leu 


Val 


Asn 


Lys 


Ala 


Glu 


Gin 


Asn 


Cys 


Gin 


Glu 






195 








200 










205 








Asn 


His 


Asn 


Gin 


Phe 


Phe 


He 


Lys 


Asn 


















210 










215 




















(2) 
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FOR 


SEQ 


ID 


NO: 2 


03 : 

















(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(IX) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 ... 75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 03 



Met 


vai 


i ie 


i>er 


Gly 


his 


Phe 


Thr 


rhr 


Tyr 


Ser 


Tyr 


lie 


Glu 


rl u 


Phe 


1 








5 










10 










15 




He 


lie 


Gin 


He 


Ser 


Gin 


Phe 


Ser 


Pro 


Asp 


He 


Thr 


Thr 


Leu 


Met 


Leu 








20 










25 










30 






Phe 


Val 


Phe 


Giy 


Leu 


Ala 


Gly 


Val 


Val 


Gly 


Ser 


Phe 


Leu 


Phe 


Gly 


Arg 






- c 

-> — ' 










40 










45 








Leu 


Tvr 


Ala 


Lys 


Asn 


Ser 


Arg 


Lys 


Phe 


He 


Ala 


Phe 


Ala 


Met 


Val 


Leu 




50 










55 










60 










Val 


He 


Cys 


Pro 


Gin 


Pro 


Leu 


Ala. 


Phe 


Cys 


Val 













65 70 75 



(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME / KEY : misc_f ea ture 

(B) LOCATION 1 . . . 192 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 04 



Met 


Lys 


Ser 


Thr 


Arg 


He 


Gly 


Ser 


Lys 


He 


Val 


Met 


Met 


Val 


Cys 


Ala 


i 








5 










10 










15 




Val 


Val 


He 


Val 


He 


Ser 


Ala 


Val 


Met 


Gly 


Val 


He 


He 


Ser 


Tyr 


Lys 








20 










25 










30 






Val 


Glu 


Ser 


Val 


Leu 


Gin 


Ser 


Gin 


Ala 


Thr 


Glu 


Leu 


Leu 


Gin 


Lys 


Lys 






35 










40 










45 








Ala 


Gin 


Leu 


Val 


Ser 


Phe 


Lys 


He 


Gin 


Gly 


He 


Met 


Lys 


Arg 


He 


Phe 




50 










55 










60 










Met 


Gly 


Ala 


Asn 


Thr 


Leu 


Glu 


Arg 


Phe 


Leu 


Ser 


Asp 


Glu 


Asn 


Gly 


Ala 


65 








70 










75 










80 


He 


Asn 


Asp 


Thr 


Leu 


Lys 


Arg 


Arg 


Met 


Leu 


Ser 


Glu 


Phe 


Leu 


Leu 


Ala 








35 










90 










95 




Asn 


Pro 


His 


Val 


Leu 


Leu 


Val 


Ser 


Ala 


He 


Tyr 


Thr 


Asn 


Asn 


Asn 


Glu 








100 










105 










110 






Arg 


Met 


He 


Thr 


Ala 


Met 


Asn 


Met 


Asp 


Ser 


Lys 


He 


Ala 


Tyr 


Pro 


Asn 




115 










120 










125 








Thr 


Ala 


Leu 


Asn 


Glu 


Asn 


Met 


Thr 


Xaa 


Pro 


He 


His 


Ser 


Leu 


Lys 


Ser 




130 










135 










140 










He 


Thr 


Arg 


Ser 


Xaa 


Pro 


Tyr 


Tyr 


Lys 


Glu 


Val 


Asn 


Xaa 


Xaa 


Lys 


He 


145 










150 










155 










160 


Tyr 


Xaa 


Xaa 


Xaa 


He 


Thr 


Leu 


Pro 


Leu 


Xaa 


Xaa 


Lys 


Asn 


Xaa 


Asn 


Xaa 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9719098A1_I_> 



WO 97/19098 PCI7US96/18542 

184 

165 170 175 

lie Xaa Xaa Leu Asn Phe Xaa Leu Asn lie Asp Xaa Phe Leu Tyr Xaa 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 20 5: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1 . . .253 





<xi) 


SEQUENC 


E DESCRIPTION: SEQ ID NO 














Met 


Ala 


Tyr 


Lys 


Tyr 


Asp 


Arg 


Asp 


Leu 


Glu 


Phe 


Leu 


Lys 


Gin 


Leu 


Glu 


1 






5 










10 










15 




Ser 


Ser 


Asp 


Leu 


Leu 


Asp 


Leu 


Phe 


Glu 


Val 




Val 


Phe 


Gly 


Lys 


Asp 






20 








25 










30 






Gly 


Glu 


Lys 


Arg 


His 


Asn 


Glu 


Lys 


Leu 


Thr 


Ser 


Ser 


He 


Glu 


Tyr 


Lys 






35 










40 










45 








Arg 


Hxs 


Gly 


Asp 


Asp 


Tyr 


Ala 


Lys 


Tyr 


Ala 


Glu 


Arg 


He 


Ala 


Glu 


Glu 


50 










55 










60 










Leu 


Gin 


Tyr 


Tyr 


Gly 


Ser 


Asn 


Ser 


Phe 


Ala 


Ser 


Phe 


He 


Lys 


Gly 


Glu 


65 




70 










75 










80 


Gly 


Val 


Leu 


Tyr 


Lys 


Glu 


He 


Leu 


Cys 


Asp 


Val 


Cys 


Asp 


Lys 


Leu 


Lys 








85 










90 










95 




Val 


Asn 


Tyr 


Asn 


Lys 


Lys 


Thr 


Glu 


Thr 


Thr 


Leu 


He 


Glu 


Gin 


Asn 


Met 






100 










105 










110 






Leu 


Ser 


Lys 


He 


Leu 


Glu 


Arg 


Ser 


Leu 


Glu 


Glu 


Met 


Aso 


Asp 


Glu 


Glu 






115 










120 










125 








Val 


Lys 


Glu 


Met 


Cys 


Asp 


Glu 


Leu 


Ser 


He 


Lys 


Asn 


Thr 


Asp 


Asn 


Leu 




130 










135 










140 










Asn 


Arg 


Gin 


Ala 


Leu 


Ser 


Ala 


Ala 


Thr 


Leu 


Thr 


Leu 


Phe 


Lys 


Met 


Gly 


145 








150 










155 










160 


Gly 


Phe 


Lys 


Ser 


Tyr 


Gin 


Leu 


Ala 


Val 


He 


Val 


Ala 


Asn 


Ala 


Val 


Ala 






165 










170 










175 




Lys 


Thr 


He 


Leu 


Gly 


Arg 


Gly 


Leu 


Ser 


Leu 


Ala 


Gly 


Asn 


Gin 


Val 


Leu 






180 










185 










190 






Thr 


Arg 


Thr 


Leu 


Ser 


Phe 


Leu 


Thr 


Gly 


Pro 


Val 


Gly 


Trp 


He 


He 


Thr 




195 










200 










205 








Gly 


Val 


Trp 


Thr 


Ala 


He 


Asp 


He 


Ala 


Gly 


Pro 


Ala 


Tyr 


Arg 


Val 


Thr 




210 










215 










220 






Gin 


Gin 


He 


Pro 


Ala 


Cys 


He 


Val 


Val 


Ala 


Thr 


Leu 


Arg 


Leu 


Lys 


Thr 


225 








230 










235 










240 


Ala 


Asn 


Glu 


Asp 


Lys 


Lys 


Ser 


Leu 


Gin 


He 


Glu 


Ser 


Val 














245 










250 














(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 206 : 

















(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 2 93 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID; <WO 97l9098A1_l_> 



WO 97/19098 



PCT/US96/18542 



185 

(A) ORGANISM: Helicobacter pylori 

(XX) FEATURE: 

(A) NAME / KEY : mi s c_ f eature 

(B) LOCATION 1 . . .293 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206 

Leu Leu Leu Phe He Val Val He Thr Ser Leu Val Lys Asn Thr He 

15 10 15 

Pro Asn He Trp Leu Thr Lys He Leu Tyr Met Ala He Leu Leu Cys 

20 25 30 

Ala He Ala His Ser Val Gly Xaa lie Leu Arg Trp Tyr Val Ser Gly 

35 40 45 

His Ser Pro Trp Ser Asn Ala Tyr Glu Ser Met Phe Tyr He Ala Trp 

50 55 60 

Ala Ser Val He Ala Gly Phe Val Leu Arg Xaa Lys Leu Ala Leu Ser 

70 "5 30 

Ala Ser Ser Phe Leu Ala Gly He Ala Leu Phe Val Ala His Leu Gly 

85 90 95 

Phe Met Asp Pro Gin He Gly Pro Leu Val Pro Val Leu Lys Ser Tyr 

100 105 HO 

Trp Leu Asn He His Val Ser Val He Thr Ala Ser Tyr Gly Phe Leu 

115 120 125 

Gly Leu Cys Phe Val Leu Gly He Leu Ser Leu Val Leu Phe He Leu 

130 135 140 

Arg Lys Gin Gly Ara Phe Asn Leu Asp Lys Thr He Leu Ser He Ser 
145 150 155 160 

Ala He Asn Glu Met Ser Met He Leu Gly Leu Phe Met Leu Thr Ala 

165 170 175 

Gly Asn Phe Leu Gly Gly Val Trp Ala Asn Glu Ser Trp Gly Arg Tyr 

180 185 190 

Trp Gly Trp Asp Pro Lys Glu Thr Trp Ala Leu He Ser He Cys Val 

195 200 205 

Tyr Ala Leu He Leu His Leu Arg Phe Leu Gly Ser Gin Asn Trp Pro 

210 215 220 

Phe He Leu Ala Ser Ser Ser Val Leu Gly Phe Tyr Ser Val Leu Met 
225 230 235 240 

Thr Leu Phe Trp Arg Glu Leu Leu Pro Phe Trp Leu Ala Gin Leu Cys 

245 250 255 

Arg Arg Xaa Ser Phe Ala Astd Pro Tyr Phe Phe He Leu Phe Gly Ser 

260 265 270 

Asp Thr Phe Arg Ser Arg He Leu Ala Tyr Phe Lys Arg His Leu Ser 

275 280 285 

Leu Pro Lys Leu Val 
290 

(2) INFORMATION FOR SEQ ID NO: 2 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix> FEATURE: 

(A) NAME / KEY : misc.feature 

(B) LOCATION 1 . . . 142 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 

Val Glu Met He His Thr Gin Asp Tvr He Lys Met Glu Glu Ala Ala 
1 5 10 15 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9719098A1_I_> 



WO 97/19098 





Vj J. li 


,rt X a. 


Tip 
lie 


T \/<= 

LjV O 


Ar q 


Lys 


Glu 


Ser 


















25 


jl 1. e 


Leu 


Lys 


Asn 




Mid 


Asp 


Ala 


Leu 
















40 






Tnr 




p l \/ 
uiy 


Leu 




1 111. 


Lte?U 


Ar g 














5 5 






Glu 


Arc 


Pro 


Ala 


He 


Cys 


Thr 


Leu 


Met 


65 










70 








Ser 


Val 


Leu 


Leu 


ASD 


Ala 


Gl v 


Ala 


Asn 










85' 










Leu 


He 


Asp 


Phe 


Ala 


Leu 


Met 


Gly 


Tyr 








100 










105 


His 


Tyr 


Asp 


Ser 


Pro 


Lys 


Val 


Gly 


Leu 






115 










120 




He 


Lys 


Gly 


Gly 


He 


Arg 


Ser 


Leu 


Lys 




130 










135 








IN FORMAT 1 UN 


i-UK 


SEo 


XL . 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino aci 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



PCT/US96/18542 



186 



Ser 


He 


1 jr - 


Leu 


^ — j 


Met 


Asp 










30 






lie 


Ser 


Ala 


Gly 


His 


Ser 


Glv 








4 5 








Leu 


Gl v 


Arg 


He 


Lys 


Glv 


Val 






60 










Pro 


Ser 


Val 


Gly 


Lys 


Arg 


Pro 




75 










80 


Thr 


Asp 


Cys 


Lys 


Pro 


Glu 


Tyr 


90 










95 




Glu 


Tyr 


Ala 


Lys 


Ser 


Val 


Leu 










110 






Leu 


Ser 


Asn 


Glv 


Glu 


Glu 


Asp 








125 








Lys 


Arg 


He 


Lys 


Cys 







140 



ds 



(ii) MOLECULE TYPE : protein 

( i i i ) HYPOTHETICAL : YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 . . . 144 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 



Met 


Leu 


Glu 


He 


Lys 


Asn 


Leu 


Asn 


Cys 


Val 


Leu 


Asn 


Ser 


His 


Phe 


Ser 


1 








5 










10 










15 




Leu 


Gin 


Asn 


He 
20 


Asn 


He 


Ser 


Leu 


Ser 
25 


Tyr 


Ser 


Glu 


Arg 


Val 
30 


Ala 


He 


Val 


Gly 


Glu 
35 


Ser 


Gly 


Ser 


Gly 


Lys 
40 


Ser 


Ser 


He 


Ala 


Asn 
45 


Leu 


Val 


Met 


Arg 


Leu 
50 


Asn 


Pro 


Arg 


Phe 


Lys 
55 


Ser 


His 


Asn 


Gly 


Glu 
60 


He 


Leu 


Phe 


Glu 


Thr 


Thr 


Asn 


Leu 


Leu 


Lys 


Glu 


Ser 


Glu 


Ala 


Phe 


Xaa 


Gin 


His 


Leu 


Ara 


65 










70 










75 










80 


Gly 


Asn 


He 


He 


Ala 


Tyr 


He 


Ala 


Gin 


Asp 


Pro 


Leu 


Ser 


Ser 


Leu 


Asn 








85 










90 










95 




Pro 


Leu 


His 


Lys 


He 


Gly 


Lys 


Gin 


Met 


Ser 


Glu 


Ala 


Tyr 


Phe 


Leu 


His 








100 








105 










110 






His 


Lys 


Asn 


Ala 


Ser 


Gin 


Val 


Ser 


Leu 


Asn 


Glu 


Gin 


Val 


Leu 


Asn 


Val 




115 










120 










125 








Met 


Lys 
130 


Gin 


Val 


Gin 


Leu 


Asp 
135 


Glu 


Asn 


Phe 


Trp 


Asn 
140 


Val 


Ser 


Leu 


Met 



(2) INFORMATION FOR SEQ ID NO: 2 09: 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9719098A1_L> 



WO 97/19098 



187 



PCT/US96/18542 



(ix) FEATURE: 

(A) NAME/ KEY : misc_£eature 
<B> LOCATION I ... S3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209 



Met 


Asn 


Tyr 


Lys 


Val 


Ala 


Ser Ala Arg 


Asn 


He 


Ala 


Thr 


Leu 


Leu 


Phe 


1 








5 






10 










15 




Leu 


Phe 


Phe 


Ser 


Gin 


Ser 


Glu Ala Phe 


Asp 


Leu 


Gly 


Lys 


He 


Ala 


Lys 








20 






25 










30 






He 


Lys 


Ala 


Gly 


Ala 


Glu 


Ser Phe Ser 


Lys 


Val 


Gly 


Phe 


Asn 


Asn 


Lys 






35 








40 








45 








Pro 


He 


Asn 


Xaa 


Asn 


Lys 


Gly He Tyr 


Pro 


Thr 


Glu 


Thr 


Phe 


Met 


Thr 




50 










55 






60 










He 


Asn 


Gly 


Leu 


His 


Ala 


Gly Gly Phe 


Tyr 


Gly 


Ala 


Leu 


Ala 


Gin 


Lys 


65 










70 






75 










80 


Arg 


Tyr 


Gly 
























(2) 


INFORMATION 


FOR 


SEQ 


ID NO:210: 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 
<B) TYPE: amino acid 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc.feature 

(B) LOCATION 1 ... 130 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210 



Met 


Asp 


Ala 


Leu 


Glu 


He 


Thr 


Gin 


Lys 


Leu 


He 


Ser 


Tyr 


Pro 


Thr 


He 


1 








5 










10 










15 




Thr 


Pro 


Lys 


Glu 


Cys 


Gly 


He 


Phe 


Glu 


Tyr 


He 


Lys 


Ser 


Leu 


Phe 


Pro 








20 










25 










30 






Ala 


Phe 


Lys 


Thr 


Leu 


Glu 


Cys 


Glu 


Lys 


Asn 


Gly 


Val 


Lys 


Asn 


Leu 


Phe 






35 










40 










45 








Leu 


Tyr 


Arg 


He 


Phe 


Asn 


Pro 


Leu 


Lys 


Lys 


His 


Ala 


Glu 


Lys 


Glu 


His 




50 










55 










60 










Ala 


Lys 


Glu 


Lys 


His 


Val 


Lys 


Glu 


Asn 


Val 


Xaa 


Pro 


Leu 


His 


Phe 


Cys 


65 










70 










75 










80 


Xaa 


Ala 


Gly 


His 


He 


Xaa 


Val 


Val 


Pro 


Pro 


Gly 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 










85 










90 










95 




Asp 


Ser 


Phe 


Xaa 


Xaa 


He 


He 


Lys 


Glu 


Gly 


Phe 


Leu 


Tyr 


Gly 


Arg 


Gly 








100 










105 










110 






Ala 


Gin 


Asp 


Met 


Lys 


Gly 


Gly 


Val 


Gly 


Xaa 


Phe 


Xaa 


Arg 


Cys 


Xaa 


Xaa 



115 120 125 



Lys Phe 
130 

(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<iii) HYPOTHETICAL: YES 



SUBSTITUTE SHEET (RUL£ 26) 

BNSDOCID: <WO 9719098A1_I_> 



WO 97/19098 



PCT/US96/18542 



188 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

<ix) FEATURE : 

(A) NAME /KEY : misc.f eature 

(B) LOCATION 1 - - .340 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 

Met lie Leu Ser He Glu Ser Ser Cys Asp Asp Ser Ser Leu Ala Leu 

15 10 15 

Thr Arg lie Glu Asp Ala Lys Leu lie Ala His Phe Lys He Ser Gin 

20 25 30 

Glu Lys His His Ser Ser Tyr Gly Gly Val Val Pro Glu He Ala Ser 

35 40 45 

Arg Leu His Ala Glu Asn Leu Pro Leu Leu Leu Glu Arg Val Lys He 

50 55 60 

Ser Leu Asn Lys Asp Phe Ser Lys He Lys Ala He Ala lie Thr Asn 
65 70 75 80 

Gin Pro Gly Leu Ser Val Thr Leu He Glu Gly Leu Met Met Ala Lys 

85 90 95 

Ala Leu Ser Leu Ser Leu Asn Leu Pro Leu He Leu Glu Asp His Leu 

100 105 HO 

Arg Gly His Val Tyr Ser Leu Phe He Asn Glu Lys Gin Thr Arg Met 

115 120 125 

Pro Leu Ser Val Leu Leu Val Ser Gly Gly His Ser Leu He Leu Glu 

130 135 140 

Ala Arg Asp Tyr Glu Asp He Lys He Val Ala Thr Ser Leu Asp Asp 
145 150 155 160 

Ser Phe Gly Glu Ser Phe Asp Lys Val Ser Lys Met Leu Asp Leu Gly 

165 170 1"5 

Tyr Pro Glv Gly Pro He Val Glu Lys Leu Ala Leu Asp Tyr Ala His 

180 185 190 

Pro Asn Glu Pro Leu Met Phe Pro He Pro Leu Lys Asn Ser Pro Asn 

195 200 205 

Leu Ala Phe Ser Phe Ser Gly Leu Lys Asn Ala Val Arg Leu Glu Val 

210 215 220 

Glu Lys Asn Ala His Asn Leu Asn Asp Glu Val Lys Gin Lys He Gly 
225 230 235 240 

Tyr His Phe Gin Ser Ala Ala He Glu His Leu He Gin Gin Thr Lys 

245 250 255 

Arg Tyr Phe Lvs He Lys Arg Pro Lys He Phe Gly He Val Gly Gly 

260 265 270 

Ala Ser Gin Asn Leu Ala Leu Arg Lys Ala Phe Glu Asp Leu Cys Ala 

275 280 285 

Glu Phe Asp Cys Glu Leu Val Leu Ala Pro Leu Glu Phe Cys Ser Asp 

290 295 300 

Asn Ala Ala Met He Gly Arg Ser Ser Leu Glu Ala Tyr Gin Lys Lys 
305 310 315 320 

Arg Phe He Pro Leu Glu Lys Ala Asp He Ser Pro Arg Thr Leu Leu 
325 330 335 

Lys Asn Phe Glu 
340 

(2) INFORMATION FOR SEQ ID NO:212: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 9719098A1J_> 



WO 97/19098 



PCT/US96/18542 



189 

(ix) FEATURE: 

(A) NAME/ KEY: misc^feature 

(B) LOCATION 1 ... 168 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 12 



Met 


Leu 


Ser 


Ser 


Asn 


Asp 


Leu 


Phe 


Met 


Val 


Val 


Leu 


Gly 


Ala 


He 


Leu 


1 


















10 










15 




Leu 


Val 


Leu 


Val 


Cys 


Leu 


Val 


Gly 


Tvr 


Leu 


Tyr 


Leu 


Lys 


Glu 


Lys 


Glu 








20 










25 










30 






Phe 


Tyr 


His 


Lys 


Met 


Arg 


Arg 


Leu 


Glu 


Lys 


Thr 


Leu 


Asp 


Glu 


Ser 


Tyr 






35 










40 










45 








Gin 


Glu 


Asn 


Tyr 


Leu 


Tyr 


Ser 


Lys 


Arg 


Leu 


Arg 


Glu 


Leu 


Glu 


Gly 


Arg 




50 










5 5 










6 0 










Leu 


Glu 


Gly 


Leu 


Ser 


Leu 


Glu 


Lys 


Ser 


Ala 


Lys 


Glu 


Asp 


Ser 


Ser 


Leu 


65 










70 










75 










80 


Lys 


Thr 


Thr 


Leu 


Ser 


His 


Leu 


Tyr 


Asn 


Gin 


Leu 


Gin 


Glu 


He 


Gin 


Lys 










65 










90 










9 5 




Ser 


Met 


Asp 


Lys 


Glu 


Arg 


Asp 


Tyr 


Leu 


Glu 


Glu 


Lys 


He 


He 


Xaa 


Xaa 








100 










105 










110 






Lys 


Thr 


Xaa 


Xaa 


Lys 


Thr 


Trp 


Gly 


He 


Met 


Pro 


Leu 


Ala 


Met 


Lys 


Ser 






115 










120 










125 








Thr 


Glu 


Lys 


Gin 


Val 


Leu 


Lys 


Met 


Tyr 


Gin 


Glu 


Gly 


Tyr 


Ser 


Val 


Asp 




130 










135 










140 










Ser 


He 


Ser 


Lys 


Glu 


Phe 


Lys 


Val 


Ser 


Lys 


Gly 


Glu 


Val 


Glu 


Phe 


He 


145 










150 










155 










160 


Leu 


Asn 


Met 


Ala 


Glv 


Leu 


Lys 


Trp 


























165 
























(2) 


INFORMATION 


FOR 


SEQ 


ID NO:213 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 
f D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc.f eature 

(B) LOCATION 1 . . . 121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 



Leu 


Asp 


Pro 


Phe 


Ser 


His 


Lys 


Glu 


Asn 


Phe 


Leu 


Ala 


Val 


Glu 


Thr 


Phe 










5 










10 










15 




Lys 


Met 


Leu 


Gly 


Lys 


Thr 


Glu 


Ser 


Lys 


Asp 


Asn 


Leu 


Asn 


Trp 


Met 


He 








20 










25 










30 






Ala 


Leu 


He 


He 


Glu 


Lys 


Asp 


Lys 


Val 


Tyr 


Glu 


Gin 


Val 


Gly 


Ser 


Val 






35 










40 










45 








Arg 


Phe 


Val 


Val 


Val 


Val 


Ala 


Ser 


Ala 


lie 


Met 


Val 


Leu 


Ala 


Leu 


He 




50 










55 










60 










He 


Ala 


He 


Thr 


Leu 


Leu 


Met 


Arg 


Ala 


lie 


Val 


Ser 


Asn 


Arg 


Leu 


Glu 


65 










70 










75 










80 


Val 


Val 


Ser 


Ser 


Thr 


Leu 


Ser 


His 


Phe 


Phe 


Lys 


Leu 


Leu 


Asn 


Asn 


Gin 










85 










90 










95 




Xaa 


His 


Ser 


Ser 


Xaa 


Xaa 


Lys 


Leu 


Val 


Xaa 


Ala 


Arg 


Ser 


Asn 


Asp 


Glu 








100 










105 










110 






Leu 


Gly 


Arg 


Xaa 


Gin 


Thr 


Xaa 


Asp 


Xaa 

















115 120 



(2) INFORMATION FOR SEQ ID NO: 214: 
(i) SEQUENCE CHARACTERISTICS: 



SUBSTITUTE SHEET (RULE 26) 



WO 97/19098 



PCT7US96/18542 



!90 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) ' LOCATION 1 ... 149 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 214 



Met 




Phe 


iyr 


Gin 


val 


iyr 


Asp 


Pro 


Leu 


ioiy 


Hi s 


He 


■i ip 






1 






5 










10 










15 




Ala 


Leu 


Val 


Ala 
20 


Leu 


Ser 


Pro 


He 


Ala 
25 


Leu 


Phe 


Phe 


He 


Ser 
30 


Leu 


He 


Val 


Phe 


Lys 


Leu 


Lys 


Gly 


Tyr 


Ser 


Ala 


Gly 


Phe 


Leu 


Ser 


Leu 


Ala 


Leu 






35 










40 










45 






Val 


Ser 


He 


Leu 


He 


Ala 


Leu 


Phe 


Val 


Tyr 


Lys 


Met 


Pro 


Val 


Gin 


Met: 




50 










55 










60 








Ala 


Ser 


Ala 


Ser 


Phe 


Phe 


Tyr 


Gly 


Phe 


Leu 


Tyr 


Gly 


Leu 


Trp 


Pro 


He 


65 










70 










75 










80 


Trp 


He 


Val 


He 


Ala 


Ala 


He 


Phe 


Leu 


Tyr 


Asn 


Leu 


Ser 


Val 


Lys 


Ser 








85 










90 










95 




Gly 


Tyr 


Phe 


Glu 


He 


Leu 


Lys 


Glu 


Ser 


He 


Leu 


Ser 


Leu 


Thr 


Pro 


Asp 




100 










105 










110 






His 


Arg 


He 


Leu 


Val 


He 


Leu 


He 


Gly 


Phe 


Cys 


Phe 


Gly 


Ser 


Phe 


Leu 




115 










120 










125 






He 


xaa 


Gly 
130 


Ala 


Xaa 


Gly 


Phe 


Gly 
135 


Gly 


Pro 


Val 


Ala 


He 
140 


Thr 


Ala 


Ala 


Leu 


Val 


Ala 


Leu 


Gly 

























145 

(2) INFORMATION FOR SEQ ID NO: 215: 

(li SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_feature 

(B) LOCATION 1 . . .325 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215 



Met 


Lys 


Arg 


He 


Leu 


Val 


Ser 


Leu 


Ala 


Val 


Leu 


Ser 


His 


Ser 


Ala 


His 


1 




5 










10 










15 




Ala 


Val 


Lys 


Thr 


His 


Asn 


Leu 


Glu 


Arg 


Val 


Glu 


Ala 


Ser 


Gly 


Val 


Ala 






20 










25 










30 






Asn 


Asp 


Lys 


Glu 


Ala 


Pro 


Leu 


Ser 


Trp 


Arg 


Ser 


Lys 


Glu 


Val 


Arg 


Asn 




35 










40 










45 








Tyr 


Met 


Gly 


Ser 


Arg 


Thr 


Val 


He 


Ser 


Asn 


Lys 


Gin 


Leu 


Thr 


Lys 


Ser 


50 










55 










60 




Gly 


Val 




Ala 


Asn 


Gin 


Ser 


He 


Glu 


Glu 


Ala 


Leu 


Gin 


Asn 


Val 


Pro 


His 


65 










70 










75 








He 


80 


He 


Arg 


Asn 


Ser 


Thr 


Gly 


He 


Gly 


Ala 


Val 


Pro 


Ser 


He 


Ser 


Arg 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WQ 9719098A1 I > 



WO 97/19098 



PCT/US96/18542 



191 

6 5 9 0 9 5 



Gly 


Phe 


Gly 


Ala 


Gly 


Gly 


Pro 


Gly 


His 


Ser 


Asn 


Thr 


Gly 


Men 


lie 


Leu 






100 










105 










110 






val 


Asn 


Glv 


He 


Pro 


He 


Tyr 


Val 


Ala 


Pro 


Tyr 


Val 


Glu 


lie 


Gly 


Thr 






115 










120 










125 








Val 


He 


Phe 


Pro 


Val 


Thr 


Phe 


Gin 


Ser 


Val 


Asp 


Arg 


He 


Ser 


Val 


Thr 




130 










135 










140 










Lys 


Gly 


Gly 


Glu 


Ser 


Val 


Arg 


Tyr 


Gly 


Pro 


Asn 


Ala 


Phe 


Gly 


Gly 


Val 


145 










150 










155 










160 


He 


Asn 


lie 


He 


Thr 


Lys 


Gly 


He 


Pro 


Thr 


Asn 


Trp 


Glu 


Ser 


Gin 


Val 










165 










170 










175 




Ser 


Glu 


Arg 


Thr 


Thr 


Phe 


Trp 


Gly 


Lys 


Ser 


Glu 


Asn 


Gly 


Glv 


Phe 


Phe 








180 










185 










190 






Asn 


Gin 


Asn 


Ser 


Lys 


Asn 


He 


Asp 


Lys 


Ser 


Leu 


Val 


Asn 


Asn 


Met 


Leu 






195 










200 










205 








Phe 


Asn 


Thr 


Tyr 


Leu 


Arg 


Thr 


Gly 


Gly 


Met 


Met 


Asn 


Lys 


His 


Phe 


Gly 




210 










215 










220 










lie 


Gin 


.aia 


Gin 


va 1 


Asn 


Tip 


Leu 


Lys 


Gly 


Gili 


Gly 


?ht 


Ax y 


Ty'i 


Acall 


225 










230 










235 










240 


Ser 


Pro 


Thr 


Asp 


He 


Gin 


Asn 


Tyr 


Met 


Leu 


Asp 


Ser 


Leu 


Tyr 


Gin 


lie 










245 










250 










255 




Asn 


Asp 


Ser 


Asn 


Lys 


He 


Thr 


Ala 


Phe 


Phe 


Gin 


Tyr 


Tyr 


Ser 


Tyr 


Phe 








260 










265 










270 






Leu 


Thr 


Asp 


Pro 


Gly 


Ser 


Leu 


Gly 


He 


Ala 


Ala 


Tyr 


Asn 


Gin 


Asn 


Arg 






275 










280 










285 








Phe 


Gin 


Asn 


Asn 


Arg 


Pro 


Asn 


Asn 


Asp 


Lys 


Ser 


Gly 


Arg 


Ala 


Lys 


Arg 




290 










295 










300 










Trp 


Gly 


Ala 


Val 


Tyr 


Gin 


Asn 


Phe 


Phe 


Gly 


Asd 


Thr 


Asp 


Arg 


Val 


Gly 


305 










310 










315 










320 


Gly 


Gly 


Phe 


His 


Phe 

























325 



12) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 . . .252 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 



Leu 


Arg 


Ser 


He 


Ser 


Arg 


He 


Lys 


Met 


Leu 


Ser 


Val 


Tyr 


Glu 


Lys 


Gly 


1 








5 










10 










15 




Asn 


Ala 


Leu 


Asp 


Lys 


Arg 


Val 


Leu 


Glu 


Glu 


Trp 


Leu 


Leu 


Ser 


Glu 


Asp 








20 










25 










30 






He 


Leu 


Met 


Glu 


Asn 


Ala 


Ala 


Met 


Ala 


Leu 


Glu 


Arg 


Ala 


Val 


Leu 


Gin 






3 5 










40 










45 








Asn 


Ala 


Ser 


Leu 


Gly 


Ala 


Lys 


Val 


He 


He 


Leu 


Cys 


Gly 


Ser 


Gly 


Asp 




50 










55 










60 










Asn 


Gly 


Gly 


Asp 


Gly 


Tyr 


Thr 


Leu 


Ala 


Arg 


Arg 


Leu 


Val 


Gly 


Arg 


Phe 


65 










70 










75 










80 


Lys 


Thr 


Leu 


Val 


Phe 


Glu 


Met 


Lys 


Leu 


Ala 


Lys 


Ser 


Pro 


Met 


Cys 


Gin 










85 










90 










95 




Leu 


Gin 


Lys 


Glu 


Arg 


Ala 


Lys 


Lys 


Val 


Gly 


Val 


Val 


He 


Lys 


Ala 


Trp 








100 










105 










110 






Glu 


Glu 


Lys 


Asn 


Glu 


Asp 


Leu 


Glu 


Cys 


Asp 


Val 


Leu 


Val 


Asp 


Cys 


Val 






115 










120 










125 








Val 


Gly 


Ser 


Ala 


Phe 


Lys 


Gly 


Gly 


Leu 


Glu 


Pro 


Phe 


Leu 


Asp 


Phe 


Glu 



BNSDOCID: <WO 9719098A1_I_> 



SUBSTITUTE SHEET (RUL£ 26) 



WO 97/19098 



PCT/US96/18542 



192 





130 










135 










140 










Ser 


Leu 


Ser 


Gin 


Lys 


Ala 


Arg 


Phe 


Lys 


He 


Ala 


Cys 


Asp 


He 


Pro 


Ser 


145 








150 










155 










160 


Glv 


He 


Asp 


Ser 


Lys 


Gly 


Arg 


Val 


Asp 


Lvs 


Arg 


Ala 


Phe 


Lys 


Xaa 


Gly 






165 










170 










175 




Tyr 


Arg 


Leu 


Ser 


Ala 


Trp 


Ala 


Leu 


Phe 


Lys 


Ser 


Cys 


Leu 


Leu 


Ser 


Xaa 




180 










185 










190 




Val 


Lys 


Xaa 


Lvs 


Xaa 


Tyr 


He 


Xaa 


Xaa 


Leu 


Lys 


Xaa 


Xaa 


His 


Leu 


Xaa 




195 








200 










205 








Phe 


Asn 


Gin 


He 


Tyr 


Glu 


He 


Pro 


Thr 


Xaa 


Thr 


Phe 


Leu 


Leu 


Glu 


Lys 




210 








215 










220 






Gin 




Xaa 


Asp 


Leu 


Lys 


Leu 


Pro 


Leu 


Arg 


Asp 


Arg 


Lys 


Lys 


Arg 


Ser 


Arg 


225 






230 










235 


Ala 








240 


Arg 


Leu 


Arg 


Ala 


Cys 


Ala 


Cys 


Ala 


Phe 


Gly 


Gin 














245 










250 















(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<iii> HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : raisc_feature 

(B) LOCATION 1 ... 138 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 17 



Met 


Ala 


Leu 


Asp 


Lys 


Arg 


He 


Trp 


Met 


His 


Phe 


Asp 


Leu 


Leu 


Pro 


Phe 


1 






5 










10 








He 


15 




Val 


Phe 


He 


He 


Pro 


Leu 


Leu 


Val 


Val 


Ser 


Phe 


Leu 


Leu 


Phe 


Glu 








20 










25 










30 




Gly 


Ser 


Ser 


Ala 


Val 


Leu 


Ser 


Leu 


Lvs 


Gin 


Gly 


Val 


Tyr 


Tyr 


Ala 


He 






35 










40 










45 








Phe 


Leu 


Leu 


Phe 


Trp 


Val 


Va 1 


Phe 


Phe 


He 


Pro 


Phe 


Arg 


Lys 


Leu 


Asp 




50 








55 










60 






Ala 




Arg 


Trp 


Leu 


Phe 


Ala 


Leu 


Tyr 


Trp 


Ala 


Cys 


Val 


He 


Leu 


Leu 


Leu 


65 








70 










75 










80 


Val 


Asp 


Phe 


Met 


Gly 


Ser 


Ser 


Lys 


Leu 


Gly 


Ala 


Gin 


Arg 


Trp 


Leu 


Val 








85 










90 








Val 


95 




He 


Pro 


Phe 


Thr 


Ser 


He 


Thr 


Leu 


Gin 


Pro 


Ser 


Glu 


Pro 


Lys 


Asn 








100 










105 










110 






Arg 


Xaa 


Ser 


Phe 


He 


Val 


Gly 


Ala 


Phe 


Xaa 


Xaa 


Asn 


Xaa 


Pro 


Asp 


Xaa 




115 










120 










125 








Leu 


Leu 


Arg 


Ala 


Met 


He 


Glv 


Ala 


Cys 


Phe 
















130 










135 




















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO : 2 1 8 : 

















(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 6 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9719098A1_I_> 



WO 97/19098 



PCT/US96/18542 



19? 

( ix) FEATURE : 

(A) NAME /KEY : misc_f ea ture 

(B) LOCATION 1 . . . 326 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 



Val 


Leu 


Me: 


Ala 


Leu 


Xaa 


Asp 


Lys 


Arg 


Tyr 


Gly 


Leu 


Glu 


Ala 


Gly 


He 


1 








5 










10 










1 5 




Lys 


Tyr 


Phe 


Thr 


Met 


Gly 


Ala 


Met 


Ala 


Ser 


Ala 


Phe 


Phe 


Ala 


Met 


Gly 






20 










25 










30 






Ala 


Met 


Ala 


Phe 


Tyr 


Leu 


Leu 


Thr 


Gly 


Ser 


Leu 


Asn 


Leu 


Glu 


Val 


He 






35 










40 










45 








Thr 


Leu 


Tyr 


Leu 


His 


Thr 


Glu 


Gly 


He 


Thr 


Asn 


Pro 


Met 


Leu 


Phe 


Ala 




50 










55 










60 










Met 


Gly 


Thr 


He 


Phe 


Leu 


He 


Gly 


Ala 


He 


Gly 


Phe 


Lys 


Val 


Ser 


Leu 


65 










70 










75 










80 


Val 


Pro 


Phe 


His 


Thr 


Trp 


Met 


Pro 


Asp 


Val 


Tyr 


Glu 


Gly 


Asn 


Asn 


Pro 










3 5 










30 










r> r 




Val 


Phe 


Ala 


Ser 


Tyr 


He 


Ser 


lie 


Val 


Pro 


Lys 


He 


Ala 


Gly 


Phe 


Val 








100 










105 










110 






Val 


Ala 


Thr 


Arg 


Leu 


Phe 


Gly 


Ala 


Phe 


He 


Asp 


Thr 


His 


Thr 


Ala 


Trp 






115 










120 










125 








Val 


Glu 


Asp 


He 


Phe 


Tyr 


Val 


Leu 


He 


Leu 


Met 


Thr 


He 


Thr 


He 


Pro 




130 










135 










140 










Asn 


Phe 


He 


Ala 


Leu 


Trp 


Gin 


Glu 


Asp 


Val 


Lys 


Arg 


Met 


Leu 


Ala 


Tyr 


145 










150 










155 










160 


Ser 


Ser 


lie 


Ser 


His 


Ser 


Gly 


Phe 


Ala 


Leu 


Ala 


Cys 


Val 


Phe 


He 


His 










165 










170 










175 




Thr 


Glu 


Asp 


Ser 


Gin 


Gin 


Ala 


Met 


Phe 


Val 


Tyr 


Trp 


Phe 


Met 


Phe 


Ala 








180 










185 










190 






Phe 


Thr 


Tyr 


He 


Gly 


Ala 


Phe 


Gly 


Leu 


Leu 


Trp 


Leu 


Leu 


Lys 


Ser 


Arg 






195 










200 










205 








Glu 


Lys 


Thr 


Trp 


Asp 


Glu 


Arg 


Tyr 


Asp 


His 


Pro 


Tyr 


Ser 


Lys 


Phe 


Asn 




210 










215 










220 










Gly 


Leu 


He 


Lys 


Thr 


His 


Pro 


Leu 


Val 


Ala 


He 


Leu 


Gly 


Ala 


He 


Phe 


225 










230 










235 










240 


Val 


Phe 


Gly 


Leu 


Ala 


Gly 


He 


Pro 


Pro 


Phe 


Ser 


Val 


Phe 


Trp 


Gly 


Lys 










245 










250 










255 




Phe 


Leu 


Ala 


Val 


Glu 


Ser 


Ala 


Leu 


Glu 


Ser 


Asn 


His 


He 


Leu 


Leu 


Ala 








260 










265 










270 






Val 


Val 


Met 


Leu 


Val 


Asn 


Ser 


Ala 


Val 


Ala 


Ala 


Phe 


Tvr 


Tyr 


Phe 


Arg 






275 










280 










285 








Trp 


Leu 


Val 


Ala 


Met 


Phe 


Phe 


Asn 


Lys 


Pro 


Leu 


Gin 


Thr 


Gin 


Ser 


Tyr 




290 










295 










300 










Ala 


Lys 


Thr 


He 


Phe 


Thr 


Pro 


Lys 


Thr 


Pro 


Pro 


Cys 


Pro 


Phe 


Met 


Arg 


305 










310 










315 










320 


Ser 


Leu 


Leu 


Pro 


Trp 
325 


Arg 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 219 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

- (A) NAME /KEY : misc_feature 

(B) LOCATION 1 ... 240 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WQ 9719098A1 I > 



WO 97/19098 



PCT/US96/18542 



194 

Met lie Asn Ser Lys Lys Ser Leu Lys Lys Gly Leu Arg Gly Phe Phe 

15 10 15 

Lvs He Leu Lys Asp Arg Asn Gly Ala His Phe Ser Cys Gly Ala Thr 

20 25 30 

Ser Gly Phe Gly Leu Glu He Ala Lys Ala Phe Leu Gin Lys Asn His 

35 40 45 

Val Val Phe Gly Thr Gly Arq Arg Gin Glu Asn Leu Gin Lys Leu Gin 

50 55 60 

Leu Ala Tyr Pro Lys Arg Phe He Pro Leu Cys Phe Asp Leu Gin Asn 
65 70 75 80 

Lvs Pro Glu Thr Lys Arg Ala He Glu Thr He Phe Ser Met Tnr Asp 

85 90 95 

Ayq He Asp Ala Leu He Asn Asn Ala Gly Leu Ala Leu Gly Leu Asn 

100 105 HO 

Lvs Ala Tyr Glu Cys Glu Leu Asp Asp Trp Glu Val Met He Asp Thr 

115 120 125 

Asn He Lys Gly Leu Leu His Leu Thr Arg Leu He Leu Pro Ser Met 

130 -l->-> ^ 

He G^u His Asp Gin Gly Thr He He Asn Leu Gly Ser He Ala Gly 
145 * 150 155 160 

Thr Tvr Ala Tyr Pro Gly Gly Xaa Val Tyr Gly Ala Ser Lys Ala Xaa 

165 170 175 

Val Lys Gin Xaa Ser Xaa Asn Leu Arg Ala Asp Val Ala Gly Thr Asn 

180 185 190 

Thr Arg Gly Arg Arg Trp Asn Pro Gly Cys Val Ala Lys Pro Lys Val 

195 200 205 

Se- Arg Val Arg Gly Lys Gly Asp Lys Pro Lys Pro Lys Ser Gly Tyr 

210 215 220 

Glu Lvs His Pro Leu Pro Gin Thr Thr Arg Gin Gly Leu Thr Ser Gly 
225 230 235 240 

(2) INFORMATION FOR SEQ ID NO: 220: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: amino acid 
{ D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc„feature 

(B) LOCATION 1 - - . 204 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 220 



50 

Lys ThJ 
65 



130 



Gly 


Val 


Val 


Leu 


Ser 


Lys 


Phe 


Asp 


Ser 


Asp 


Ser 


Lys 


Gly 


Gly 




5 










10 










15 




Leu 


Gly 


He 


Thr 


Tyr 


Gin 


Leu 


Gly 


Leu 


Pro 


Leu 


Arg 


Phe 


He 




20 










25 










30 


Glu 




Gly 


Glu 


Lys 


He 


Pro 


Asp 


Leu 


Asp 


Val 


Phe 


Met 


Pro 


Arg 


35 








40 










45 




Ala 


Glu 


Gly 


Arg 


Leu 


Met 


Gly 


Ala 


Gly 


Asp 


He 


He 


Ser 


Leu 








55 










60 










Ala 


Ser 


Val 


Leu 


Asn 


Pro 


Asn 


Glu 


Ala 


Lys 


Asp 


Leu 


Ser 


Lys 






70 










75 










80 


Lys 


Lys 


Gly 


Gin 


Phe 


Thr 


Phe 


Asn 


Asp 


Phe 


Leu 


Asn 


Gin 


He 


85 










90 






He 


Ser 


95 


He 


Val 


Lys 


Lys 


Leu 


Gly 


Ser 


Met 


Ser 


Ser 


Leu 


Met 




100 








105 










110 


Leu 


Glu 


Leu 


Gly 


Asn 


Met 


Ala 


Ser 


Ala 


Leu 


Lys 


Asp 


Thr 


Asp 


115 








120 










125 






Thr 


Leu 


Glu 


Val 


Lys 


Lys 


He 


Lys 


Ala 


Met 


Val 


Asn 


Ser 


Met 






135 










140 
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Lys 


Lys 


Giu 


Arg 


Glu 


Asn 


Pro 


Glu 


He 


Leu 


Asn 


Gly 


Ser 


Arg 


Arg 


Lys 


145 










150 










155 










160 


Arg 


He 


Ala 


Leu 


Gly 
165 


Xaa 


Gly 


Leu 


Glu 


Xaa 
170 


Xaa 


Glu 


He 


Asn 


Arg 
175 


He 


He 


Lys 


Arg 


Phe 
180 


Asp 


Gin 


Ala 


Ser 


Lys 
185 


Met 


Ala 


Lys 


Arg 


Leu 
190 


Thr 


Asn 


Lys 


Lys 


Gly 
195 


He 


Ser 


Asp 


Leu 


Met 
200 


Asn 


Leu 


Xaa 


Xaa 











(2) INFORMATION FOR SEQ ID NO: 221: 

Ci) SEQUENCE CHARACTERISTICS : 

( A ) LENGTH: 9 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(iii} HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY: misc_ feature 

(B) LOCATION 1 ... 92 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 



Val 


Glu 


Lys 


Ala 


His 


Pro 


Asp 


Val 


Phe 


Asn 


Leu 


Leu 


Leu 


Gin 


Val 


Leu 


1 








5 










10 










15 




Asp 


Glu 


Gly 


Kis 


Leu 


Thr 


Asp 


Ser 


Lys 


Gly 


Val 


Arg 


Val 


Asp 


Phe 


Lys 








20 










25 










30 






Asn 


Thr 


He 


Leu 


He 


Leu 


Thr 


Ser 


Asn 


Val 


Ala 


Ser 


Gly 


Ala 


Leu 


Leu 






35 










40 










45 








Glu 


Glu 


Asp 


Leu 


Ser 


Glu 


Ala 


Asp 


Lys 


Gin 


Lys 


Ala 


He 


Lys 


Glu 


Ser 




50 










55 










60 










Leu 


Arg 


Gin 


Phe 


Phe 


Lys 


Pro 


Glu 


Phe 


Leu 


Asn 


Arg 


Leu 


Asp 


Glu 


He 


65 










70 










75 










80 


He 


Ser 


Phe 


Asn 


Ala 


Leu 


Asp 


Ser 


His 


Ala 


He 


He 


















85 










90 














(2) 


INFORMATION 


FOR 


SEQ 


ID NO : 222 : 

















( i } SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(IX) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 ... 62 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 

Leu Val Phe Leu Asp Arg Arq Leu He Val Met Val Thr Asp Ser Lys 

15 10 15 

Gly .Ser Arg Tvr He Asn Val His He Leu Phe Arg Gin He Ser Leu 

20 25 30 

Tyr Ala Leu Leu Ser Val Val Glv Ser Leu Leu Phe Leu Gly Val Ser 

35 4(T 45 

Leu Leu Val Leu Asn Lys Glu He Lys Asn He Glu Lys Gin His Ala 
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50 55 60 

Leu Xaa Lys Glu Phe Glu Lvs Lys Arg Glu Thr Asn Glu Aaa Leu 

70 75 80 



65 
Ser 



Xaa 



(2) INFORMATION FOR SEQ ID NO: 223: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 3 amino acids 

( B ) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

ivi) uKiUiWaL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 ... 233 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:223 

Leu Ser Leu Met Xaa Val Leu Asn Ala Lys Glu Cys Val Xaa Pro lie 

1 5 io 15 

Thr A-g Ser Val Lys Tyr His Gin Gin Ser Ala Glu lie Arg Ala Leu 

20 25 30 

Gin Leu Gin Ser Tyr Lys Met Ala Lys Met Ala Leu Asp Asn Asn Leu 

35 40 45 

Lys Leu Val Lys Asp Lys Lys Pro Ala Val He Leu Asp Leu Asp Glu 

50 55 60 

Thr Val Leu Asn Thr Phe Asp Tyr Ala Gly Tyr Leu Val Lys Asn Cys 
65 70 75 80 

Tie Lvs Tyr Thr Pro Glu Thr Trp Asp Lys Phe Glu Lys Glu Gly Ser 

85 90 95 

Leu ^hr Leu He Pro Gly Ala Leu Asp Phe Leu Glu Tyr Ala Asn Ser 

100 105 HO 

Lys Glv Val Lys He Phe Tyr He Ser Asn Arg Thr Gin Lys Asn Lys 

" ti5 120 125 

Ala Phe ^hr Leu Lvs Thr Leu Lys Ser Phe Lys Leu Pro Gin Val Ser 

130 135 140 

Glu Glu Ser Val Leu Leu Lys Glu Lys Gly Lys Pro Lys Ala Val Arg 
'45 150 155 . 160 

Arg Glu Leu Val Ala Lys Asp Tyr Ala He Val Leu Gin Val Gly Asp 

165 170 175 

Thr Leu His Asp Phe Asp Ala He Phe Ala Lys Asp Ala Lys Asn Ser 

180 18^ 190 

Gin Glu Gin Gin Ala Lys Val Leu Gin Asn Ala Gin Lys Phe Gly Thr 

195 200 205 

G i u Tro He He Leu Pro Asn Ser Leu Tyr Gly Thr Trp Giu Asp Gly 

210 215 220 

Pro lie Lys Ala Trp Gin Asn Lys Lys 
225 230 

(2) INFORMATION FOR SEQ ID NO: 224: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY: misc„f eatur e 

(B) LOCATION 1 ... 85 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 



Met 


Leu 


Ala 


Ala 


Gly 


Leu 


Thr 


Leu 


Pro 


Glu 


Phe 


Gly 


Cys 


Tyr 


Leu 


Ser 


1 








5 










10 










15 




His 


Tyr 


Leu 


Leu 


Trp 


Lys 


Glu 


Cys 


Val 


Lys 


Leu 


Asp 


Gin 


Pro 


Val 


Val 








20 










25 










30 






He 


Leu 


Glu 


Asp 


Asp 


Val 


Thr 


Leu 


Glu 


Ser 


His 


Phe 


Met 


Gin 


Ala 


Leu 






35 










40 










45 








Glu 


ASD 


Cys 


Leu 


Lys 


Ser 


Pro 


Phe 


Asp 


Phe 


Val 


Arg 


Leu 


Tyr 


Gly 


Cys 




50~ 










55 










60 










Tyr 


Trp 


Tyr 


Tyr 


Gin 


Arg 


Asp 


Lys 


He 


Pro 


Cys 


Phe 


Ala 


Gin 


Arg 


He 


6 5 










7 0 










7 5 










o 0 


Cys 


He 


Ser 


Ser 


Leu 
85 
























(2) 


INFORMATION 


FOR 


SEQ 


ID NO:225 : 

















<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(AJ ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 
<B) LOCATION 1 . . .115 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 



Leu 


He 


Ala 


Leu 


Arg 


Val 


Thr 


Ala 


Trp 


Lys 


Val 


Xaa 


Ala 


Met 


Lys 


Arg 


1 








5 










10 










15 




Leu 


His 


Leu 


Ser 


Val 


Lys 


Asp 


Ala 


Glu 


Asn 


Phe 


Asp 


Ala 


He 


Leu 


Arg 








20 










25 










30 






Glu 


Arg 


Pro 


Phe 


Phe 


Lys 


Asp 


Leu 


He 


Glu 


Phe 


Met 


Val 


Ser 


Gly 


Pro 






35 










40 










45 








Val 


Val 


Val 


Met 


Val 


Leu 


Glu 


Gly 


Lys 


Asp 


Ala 


Val 


Ala 


Lys 


Asn 


Arg 




50 










55 










60 










Glu 


Leu 


Met 


Gly 


Ala 


Thr 


Asp 


Pro 


Lys 


Leu 


Ala 


Gin 


Lys 


Gly 


Thr 


He 


65 










70 










75 










80 


Arg 


Ala 


Asp 


Phe 


Ala 


Glu 


Ser 


He 


Asp 


Ala 


Asn 


Ala 


Val 


His 


Gly 


Ser 










85 










90 










95 




Asp 


Ser 


Leu 


Glu 


Asn 


Ala 


His 


Asn 


Glu 


He 


Ala 


Phe 


Phe 


Phe 


Ala 


Ala 



100 105 110 



Arg Glu Phe 
115 

(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(iii) HYPOTHETICAL: YES 
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£vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eacure 

(B) LOCATION 1 ... 394 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 6 

Leu Met Trp Leu Lys Thr Leu Thr Leu Gin Thr Leu Asn Thr Asp Lys 

15 10 15 

Ala Leu Gin Glu Phe Ser Lys Thr Met Glu Ala Phe Lys Thr Lys Leu 

20 25 30 

He Gin Ser Ala Asn Asp Val His Ser Glu Thr Ser Arg Ala Ala He 

35 40 45 

Ala Asn Asp Leu Glu Arg Leu Lys Glu His Met He Asn Val Ala Asn 

50 55 60 

Thr Ser He Giy Giy Glu Phe Leu t'he Giy oiy ser Lys Val Asp Arg 
65 70 75 80 

Pro Pro He Asp Ser Asn Giy Lys Tyr His Giy Asn Giy Glu Asp Leu 

85 90 95 

Asn Ala Leu He Ser Ser Asp Asn Leu Val Pro Tyr Asn He Ser Giy 

100 105 HO 

Gin Asp Leu Phe Leu Giy Thr Asp Lys Asp Lys His Lys Leu He Thr 

115 120 125 

Thr Asn He Lys Leu Leu Asn Gin Asn Lys Leu Xaa Pro Asp Val Met 

130 135 140 

Asp Ala Leu Glu His Ser Ser Leu Pro Glu Glu Val Phe He Lys Pro 
145 150 155 160 

Ser Asp Thr Leu Arg Glu Leu lie Giy Asp Asn Asp Lys Asn Pro Thr 

165 170 175 

Asn Asp Pro Lys Glu Phe Phe Tyr Leu Gin Giy He Arg Pro Asp Giy 

180 185 190 

Ser Ser Phe Lys Glu Lys Phe Ala Leu Asp Lys Ala Tyr Gin Asn Gin 

195 200 205 

Glu Ser Ala Thr Lys Val Ser Asp Leu Leu Asp Lys He Giy His Ala 

210 215 220 

Tyr Giy Asn Thr Ser Gin Asn Lys Val Val Asp Val Ser Leu Asn Asn 
225 230 235 240 

^rp Giy Gin He Glu He Lys Asn Leu Thr Pro Giy Ser Glu Asn Leu 

245 250 255 

Asp Phe His Leu He Ser Ser Asp Giy Asp Phe Asp Asp Leu Asp Ala 

260 265 270 

Leu Arg Ser Ser Giy Lys Arg Val Thr Glu Tyr Val Lys Ser Ala Phe 

275 280 285 

Val Thr Asp Arg Ser Leu Ser Gin Val Lys Ala Val Pro Asn Met Tyr 

290 295 300 

Asn Pro Lys Val Leu Glu He Pro Ser Val Phe Val Thr Lys Asp Asn 
305 310 315 320 

Vai Leu Ala Asn Lys Asn Thr Lys Leu Ser Glu He Phe Giy Asp Lys 

325 330 335 

Val Glu Thr Leu Lys He Asn Ala Ser Arg Leu Giy Asp Glu Ser Ala 

340 345 350 

fie Lys He Pro Asn Leu Pro He Asn Leu Asp He Pro He Leu Leu 

355 360 365 

Asp Val Lys Asn Ser Thr He Lys Asp Leu Lys Asp Ala He Lys Glu 

370 375 380 

Arg Phe Asn Asn Glu Giy Giy Cys Giy Asn 
385 390 

(2) INFORMATION FOR SEQ ID NO: 2 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . . 102 

(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 227 



Leu 


Lys 


Ala 


Leu 


Asn 


Asp 


Cys 


Met 


Val 


Phe 


Phe 


His 


Lys 


Lys 


He 


He 










5 










10 










15 




Leu 


Asn 


Phe 


lie 


Tyr 


Ser 


Leu 


Met 


Val 


Ala 


Phe 


Leu 


Phe 


His 


Leu 


Ser 








20 










25 










30 






Tyr 


Gly 


Val 


Leu 


Leu 


Lys 


Ala 


Asp 


Gly 


Met 


Ala 


Lys 


Lys 


Gin 


Thr 


Leu 






35 










40 










45 








Leu 


Val 


Gly 


Glu 


Arg 


Leu 


Val 


Trp 


Asp 


Lys 


L.eu 


Thr 


Leu 


Leu 


Gly 


Phe 




50 










55 










60 










Leu 


Glu 


Lys 


Asn 


His 


He 


Pro 


Gin 


Lys 


Leu 


Tyr 


Tyr 


Asn 


Leu 


Ser 


Ser 


65 










70 










75 










80 


Gin 


Asp 


Lys 


Glu 


Leu 


Ser 


Ala 


Glu 


He 


Gin 


Ser 


Asn 


Val 


Thr 


Tvr 


Tyr 










85 










90 










95 




Xaa 


Phe 


Lys 


Arg 


Cys 


Lys 























100 

{2) INFORMATION FOR SEQ ID NO: 22 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc__f eature 
<B) LOCATION 1 . . .3 63 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:228 



Met 


Lys 


Phe 


Phe 


Leu 


Leu 


Lys 


Lys 


Phe 


Ser 


Xaa 


Phe 


Leu 


Asn 


Thr 


Gin 


1 








5 










10 










15 




Thr 


His 


Phe 


Asn 


Leu 


Lys 


Arg 


Leu 


Asn 


Ala 


Ser 


Ser 


Phe 


Leu 


Leu 


Glu 








20 










25 










30 






Thr 


Phe 


Ser 


Lys 


Glu 


Lys 


His 


Ala 


Phe 


Val 


Val 


Asp 


Leu 


Ser 


Ala 


Pro 






35 










40 










45 








Tyr 


He 


Gly 


Leu 


Ser 


Lys 


Lys 


Pro 


Pro 


Glu 


Ser 


Val 


Leu 


Lys 


Asn 


Thr 




50 










55 










60 










Leu 


Ala 


Leu 


Asp 


Phe 


Cys 


Leu 


Asn 


Lys 


Phe 


Thr 


Lys 


Asn 


Ala 


Lys 


He 


65 










70 










75 










80 


Leu 


Gin 


Ala 


Asn 


Val 


He 


Asp 


Asn 


Asp 


Arg 


He 


Leu 


Glu 


lie 


Lys 


Gly 










85 










90 










95 




Ala 


Lys 


Asp 


Leu 


Ala 


Tyr 


Lys 


Ser 


Glu 


Thr 


Phe 


He 


Leu 


Arg 


Leu 


Glu 








100 










105 










110 






Met 


lie 


Pro 


Lys 


Lys 


Ala 


Asn 


Leu 


Met 


lie 


Leu 


Asp 


Gin 


Glu 


Lys 


Cys 






115 










120 










125 








Val 


He 


Glu 


Ala 


Phe 


Arg 


Phe 


Asn 


Asp 


Arg 


Val 


Ala 


Lys 


Asn 


Asp 


He 




130 










135 










140 










Leu 


Gly 


Ala 


Leu 


Pro 


Pro 


Asn 


He 


Tyr 


Glu 


His 


Gin 


Glu 


Glu 


Asp 


Leu 


145 










150 










155 










160 


Asp 


Phe 


Lys 


Gly 


Leu 


Leu 


Asp 


He 


Leu 


Glu 


Lys 


Asp 


Phe 


Leu 


Ser 


Tyr 










165 










170 










175 




Gin 


His 


Lys 


Glu 


Leu 


Glu 


His 


Lys 


Lys 


Asn 


Gin 


He 


He 


Lys 


Arg 


Leu 
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180 










185 










190 






As n 


Ala 


Gin 


Lys 


Glu 


Arg 


Leu 


Lys 


Glu 


Lys 


Leu 


Glu 


Lys 


Leu 


Glu 


Asp 






195 








200 










205 








P Y" O 


Lvs 


Thr 


Leu 


Gin 


Leu 


Glu 


Ala 


Lys 


Glu 


Leu 


Gin 


Thr 


Gin 


Ala 


Ser 




210 










215 










220 








Val 




LeU 


Le u 


Thr 


?v *" 

» y - 


Gin 


His 


Leu 


He 


Asn 


Arg 


Arg 


Glu 


Asn 


Arg 


Z Z J 










230 










235 










240 


lie 


Leu 


Lys 


Asp 


Phe 


Glu 


Asp 


Lys 


Glu 


Cys 


Met 


He 


Glu 


He 


Asp 


Lys 






245 










250 










255 




Ser 


L T lC L. 


r> v- q 


Leu 


As n 


Ala 


Phe 


He 


Asn 


Lys 


Lys 


Phe 


Thr 


Leu 


Ser 


Lys 






260 










265 










270 




Leu 


Lys 


by o 


Lv^s 


Gin 


Lys 


Ser 


Gin 


Phe 


Leu 


Tyr 


Leu 


Glu 


Glu 


Glu 


Asn 


275 










280 










285 


Val 






Lys 


Glu 


Lys 


He 


Ala 


Phe 


Lys 


Glu 


Asn 


Gin 


He 


Asn 


Tyr 


Arg 


Asp 


290 








295 










inn 
JUL) 










Ala 


Ala 


Glu 


Glu 


Ser 


Val 


Leu 


Glu 


Met 


Phe 


Met 


Pro 


Val 


Lys 


Asn 


Ser 


305 










310 










315 










320 


Lys 


lie 


Lys 


Arg 


Pro 


Met 


Asn 


Gly 


Tyr 


G i u 


Vd 1 


Ltfu 


Tyi 


Tyj. 


Ly b>' 


AiiU 






325 










330 










335 




Xaa 


Lys 


Xaa 


Gly 


Leu 


Gly 


Lys 


Thr 


Lys 


Lys 


Arg 


He 


Ser 


Ser 


Phe 


Tyr 






340 










345 










350 






Lys 


Thr 


Gin 


Xaa 


Arg 


Met 


He 


Xaa 


Gly 


Cys 


Xaa 














355 










360 


















(2) 


INFORMATION 


FOR 


SEQ 


ID 


NO: 2 


2 9 : 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi> ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
<3) LOCATION I ... 22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 

ATATCCATGG TGAGTTTGAT GA 

(2) INFORMATION FOR SEQ ID NO: 23 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : mi sc. tea cure 

(B) LOCATION 1 ... 25 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 30 
ATGAATTCAA TTTTTTATTT TGCCA 25 
(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(IV) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc.feature 
<B) LOCATION I . . .21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 31 

AATTCCATGG TGGGGGCTAT G 21 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 ... 23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:232 
ATGAATTCTC GATAGCCAAA ATC 2 3 

(2) INFORMATION FOR SEQ ID NO:233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

{ ix) FEATURE: 

(A) NAME/ KEY : misc_feacure 

(B) LOCATION 1 ... 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 3 3 
ATTTCCATGG TCATGTCTCA TATT 
(2) INFORMATION FOR SEQ ID NO: 23 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
iL» VuKjLOGx : circular 

(ii) MOLECULE TYPE; DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 23 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 23 4 
ATGAATTCCA TCTTTTATTC CAC 
(2) INFORMATION FOR SEQ ID NO: 23 5: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : mi sc_ feature 

(B) LOCATION 1 ... 27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 5 
AACCATGGTG ATTTTAAGCA TTGAAAG 
(2) INFORMATION FOR SEQ ID NO; 23 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(lv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . .28 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 6 
AAGAATTCCA CTCAAAATTT TTTAACAG 2 8 

(2) iwFORHATION FOR SEQ ID NO: 23 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(lv) ANTI -SENSE: NO 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 
(3) LOCATION 1 ... 25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:237 

GATCATCCAT ATGTTATCTT CTAAT 2 5 

(2) INFORMATION FOR SEQ ID NO: 23 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 

(B) LOCATION 1 ... 23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 8 
TGAATTCAAC CATTTTAACC CTG 23 
(2) INFORMATION FOR SEQ ID NO: 23 9: 
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(i) SEQUENCE: CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 
{ B ) LOCATION I- - -27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239 

TATACCATGG TGAAATTTTT TCTTTTA 

(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 ... 25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240 
AGAATTCAAT TGCGTCTTGT AAAAG 
(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_£eature 

(B) LOCATION 1 ... 25 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9719098A1_I_> 



WO 97/19098 



PCT/US96/18542 



205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 
TTATGGATCC AAACCAATTA AAACT 
(2) INFORMATION FOR SEQ ID NO: 24 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

{iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 
TATCTCGAGT TATAGAGAAG GGC 
(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 2 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 . . .22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 
ATATCCATGG TG AG TTTG AT GA 
(2) INFORMATION FOR SEQ ID NO: 24 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

( i i-i ) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 . . .25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 4 
ATGAATTCAA TTTTTTATTT TGCCA 
(2) INFORMATION FOR SEQ ID NO: 24 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

( iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eacure 

(B) LOCATION 1 ... 23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245 
AATTCCATGG CTATCCAAAT CCG 
(2) INFORMATION FOR SEQ ID NO: 246: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1. . .25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 46 
ATGAATTCGC CAAAATCGTA GTATT 
(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : mi sc_ feature 
(B> LOCATION 1 . . .24 

(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 247 

GATACCATGG AATTTATGAA AAAG 2 4 

(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 
{ B) LOCATION 1 ... 25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 

TGAATTCGAA AAAGTGTAGT TATAC 2 5 

(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:249 
TTGAACACTT TTGATTATGC GG 22 
(2) INFORMATION FOR SEQ ID NO: 250: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 base pairs 

(B) TYPE : nucleic acid 
<C} STRANDEDNESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(IX) FEATURE: 

(A) NAME /KEY : misc_f eature 
<B) LOCATION 1 ... 23 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 2 50 

GGATTATGCG ATTGTTTTAC AAG 

(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(S) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 5 1 
GTCTTTAGCA AAAATGGCGT C 
(2) INFORMATION FOR SEQ ID NO: 2 52: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 ... 21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 52 
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AATGAGCGTA AGAGAGCCTT C 21 
(2) INFORMATION FOR SEQ ID NO: 2 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

( vi ) ORIGINAL SGuau L . 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 ... 18 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO; 2 53 
CTTATGGGGG TATTGTCA 18 
(2) INFORMATION FOR SEQ ID NO:254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : doubl e 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM : Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 

(B) LOCATION 1 ... 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 54 
AGCATGTGGG TATCCAGC 18 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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( ix) FEATURE: 

(A) NAME / KEY : misc^feature 

(B) LOCATION 1 ... 19 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 25 5 
AGGTTGTTGC CTAAAGACT 

(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
(C} STRANDEDNESS : double 
(D) TOPOLOGY: circular 

( i i / i-jOLEC JLE TYPE: DN*-\ v yenciuiC/ 

(iii) HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_£eature 

(B) LOCATION 1 ... 18 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 25 6 
CTGCCTCCAC CTTTGATC 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

( ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION 1 ... 19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:257 
ACCAATATCA ATTGGCACT 

(2) INFORMATION FOR SEQ ID NO: 25 8: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(iii MOLECULE TYPE : DNA (genomic) 
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(iii; HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc.feature 

(B) LOCATION 1 ... 18 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 2 5 8 
ACTTGGAAAA GCTCTGCA 18 
(2) INFORMATION FOR SEQ ID NO: 25 9: 



(i) Sr.QUEN\-r- CHARACTER j. ST ICS : 

(A) LENGTH : 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f eature 
{ B } LOCATION 1 ... 19 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259 

CTTGCTTGTC ATATCTAGC 19 

(2) INFORMATION FOR SEQ ID NO: 2 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f ea ture 

(B) LOCATION 1 ... 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260 

GTTGAAGTGT TGGTGCTA 18 

(2) INFORMATION FOR SEQ ID NO: 2 61: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
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<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : mi sc_f earure 

(B) LOCATION 1 . . .22 

CAAGCAAGTG GTTTGGTTTT AG 
(2) INFORMATION FOR SEQ ID NO: 2 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(S) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_f ea ture 

(B) LOCATION 1 ... 22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262 
TGGAAAGAGC AAATCATTGA AG 
(2) INFORMATION FOR SEQ ID NO: 263: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME /KEY : misc_feature 

(B) LOCATION 1 ... 21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263 
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GCCCATAATC AAAAAGCCCA T 21 
(2) INFORMATION FOR SEQ ID NO: 26 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc.feature 

(B) LOCATION 1 ... 24 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 64 

CTAAAACCAA ACCACTTGCT TGTC 2 4 

(2} INFORMATION FOR SEQ ID NO: 26 5: 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 16 base pairs 

(E) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 ... 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 5 
GTAAAACGAC GGCCAG 16 
(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 
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(ix) FEATURE: 

(A) NAME / KEY : misc.teature 

(B) LOCATION 1 ... 17 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 6 
CAGGAAACAG CTATGAC 

(2) INFORMATION FOR SEQ ID NO: 2 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME / KEY : misc_f ea ture 
(3) LOCATION 1 ... 21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:267 

ATCTTACCTA TCACCTCAAA T 

(2) INFORMATION FOR SEQ ID NO: 26 8: 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNES S : doub 1 e 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(ix) FEATURE: 

(A) NAME/ KEY : misc_feature 

(B) LOCATION 1 ... 21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 68 
AGACAGCAAC ATCTTTGTGA A 
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5 CLAIMS 

1. A substantially pure nucleic acid encoding an H. pylori polypeptide, said nucleic 
acid comprising a nucleotide sequence selected from the group consisting of SEQ ID 
NOs:l-114. 

2. A substantially pure nucleic acid from naturally occurring H. pylori which 
hybridizes under stringent conditions to a nucleic acid which encodes an H. pylori 
polypeptide, said nucleic acid comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1 - 1 1 4. 

3. A method of evaluating a compound for the ability to bind an H. pylori nucleic 
acid comprising: contacting said compound with an H. pylori nucleic acid selected from 
the group consisting of SEQ ID NO: 1-1 14 and determining if said compound binds said H. 
pylori nucleic acid. 

4. The method of claim 3, wherein said compound is an activator of the bacterial 
life cycle. 

5. The method of claim 3, wherein said compound is an inhibitor of the bacterial 
25 life cycle. 

6. The method of claim 3, wherein said method is performed in vitro. 

7. The method of claim 3, wherein said method is performed in vivo, 

30 

8. A method of generating a vaccine for immunizing a subject against H. pylori 
infection comprising: immunizing said subject with a nucleic acid encoding an H. pylori 
polypeptide or a fragment thereof, said nucleic acid comprising a nucleotide sequence 
selected from the group consisting of SEQ ID NO: 1-1 14, and a therapeutically acceptable 

35 carrier. 

9. A method of detecting the presence of a Helicobacter species in a sample 
comprising: 

contacting said sample with a nucleic acid encoding an H. pylori polypeptide, said 
40 nucleic acid comprising a nucleotide sequence selected from the group consisting of SEQ 
ID NO:l-114; 

hybridizing said sample to said nucleic acid; 
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5 said hybridization being indicative of the presence of said Helicobacter species in said 
sample. 

10. The method of claim 9, wherein said Helicobacter species is H pylori. 

10 11. The method of claim 9, wherein said nucleic acid is 20 or more nucleotides in 

length. 

12. A method of inhibiting expression of a gene from a Helicobacter species 
comprising: administering to said species an H pylori amisense nucleic acid selected from 

15 the group consisting of SEQ ID NO: 1-1 14. 

13. The method of claim 12, wherein said Helicobacter species is H. pylori. 

14. The method of claim 12, wherein said antisense nucleic acid is administered in 
20 a carrier. 

15. The method of claim 12, wherein said carrier is a liposome or a bacteriophage. 

16. The method of claim 12, wherein said antisense nucleic acid is 20 or more 
25 nucleotides in length. 

17. The method of claim 12, wherein said antisense nucleic acid is capable of 
binding to Helicobacter nucleic acid or mRNA. 

30 18. A method of generating a vaccine for immunizing a subject against H. pylori 

infection comprising: immunizing said subject with a nucleic acid encoding an H. pylori 
polypeptide or a fragment thereof, said polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 115-228, and a therapeutically 
acceptable carrier. 

35 
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